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ABSTRACT 

This report consists of a series of investigations on 
problems of the type which arise in the control of switched 
electrical networks. The main results concern the 
controllability and observability of these systems. Subsequent 
work will address itself to the stabilization of these systems, 
building on the basic theory given here. 




System Theory on Group Manifolds and Coset Spaces 

** 

R. W. Brockett 


Abstract 

The purpose of this paper is to study questions regarding con- 
trollability, observability, and realisation theory for a particular 
class of systems for which the state space is a differentiable mani- 
fold which is simultaneously a group or, more generally, a coset 
space. We show that it is possible to give * her explicit expressions 
for the reachable set and the set of indistinguishable states in the 
case of autonomous systems. We also establish a type of state space 
isomorphism theorem. These results parallel, and in part specialise 
to, results available for the familiar case described by &(t) ■ 
Ax(t)+Bu(t); y (t)«Cx(t) . Our objective is to reduce all questions 
about the system to questions about Lie algebras generated from the 
coefficient matrices entering in the description of the system and in 
that way arrive at conditions which are easily visualised and tested. 
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1. Introduction 

A standard assumption in modern control theory is that the state 
space Is z vector space. This assumption is both valid and natural in 
many situations, but there is a significant class of problems for which 
it cannot be made* Typical of these are certain problems which arise 
in the control of the attitude of a rigid body. The state space in 
this case is not a vector space. Linearization often destroys the 
essence of the problem — even if one can work locally — and in any 
case new and different methods are needed for treating global questions. 

In this paper we substitute the following hypothesis for the usual 
vector space assumptions. We let 9 and be matrix groups and study 

V 

X(t) - (A + [ u, (t)B.)X(t) ; y(t) - #X(t) ; X e & 

i-1 

where A and belong to the Lie algebra associated with the u i are 
the controls, and the notation #X(t) is to be Interpreted as being a 
coset in 3F. We also study vector systems of a similar type whereby 
we can view their evolution as occurlng in a coset space. The results 
concern the explicit construction of the readiable set and a character* 
izatlon of observability which is easily tested. Our main point is that 
this class of systems is in many ways not more difficult than linear 
systems of the usual type in ^ff ^ 1 . 

There is a moderately large literature on the use of Chow's results [l] 
and related ideas to study controllability. Including the work of Hermann, 
Kucera, Hermes, Haynes, and Lobry (see [2*6]). This work is relevant here 
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but we are directly Interested In controllability only in-ao-far 
as It contributes to the identification of a framework in which we 
can study a full range of system- theoretic questions, including 
observability and realization theory. Notice that it is impossible 
to pass directly from controllability results to observability results 
in the present set up because there is no clear notion of duality. 

The main motivation for this work came from some work on Lie algebraic 
methods in differential equations (see [7-101) and, above all, from 
being confronted with certain physical problems where linear theory 
was simply inadequate. 

Some unpublished work [l8»19l by Jurdjevlc and Sussaann la 
related to this paper. In particular they give in [19] an alternative 
proof of our theorem 5 and make a serious study of the nnsy— iitrlc 
case (treated only superficially in theorem 7 here). We also mention 
a recent paper by Elliott [20]. 
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2. Examples 

We postpone the development of the subject long enough to present 
a few simple examples which will help justify why the assumptions are 
set up the way they are. 

Example 1 ; (Control Systems Design) Consider the problem of deter- 
mining the gain, k in the system 

x(t) - Ax(t) - k(t)Bcx(t) 

so as to achieve good performance relative to an index of the form 
f| “ I x*(t)Mx(t)dt ; M ■ H* * 0 

>0 

If a particular initial state is chosen and k( ) is selected so as 
to minimize n, then the performance might be bad relative to some other 
initial state. In cases where the initial state is not known It is much 
more realistic to pick a collection of initial state vectors and to 
pick k in such a way as to minimize a weighted average of the individual 
performances. In fact, just to insure stability it is necessary to 
average over at least n linearly independent initial states. If exactly 
n are chosen then k should be regarded as controlling the evolution of 
the matrix equation 

*(t) - (A-k(t)bc)*(t) ; ♦«» - 

The state space is then the space of nonsingular n by n matrices, ®£(n) 
Example 2 : (Rigid body control) The orientation of a rigid body 


relative to some fixed set of axes is described by a 3 by 3 orthogonal 
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matrix A which satisfies the differential equation 


i 12 (t> i 13 (tr 


o u» 3 (t) -u» 2 (tr 

p 

i n (t) i 22 (t) i 23 (t) 

• 

-« 3 (t) 0 w 2 (t) 


,i 3 l<t) i J2 <t) i 33 (t). 


. 0» 2 <t) -w^t) 0 _ 

J 


l ir 


12 ' 


13 


21 22 23 
L*qi <t) ®32^> 


33 


The b)'a themselves are usually controlled via the equations 


^(t) - [(I 2 -I 3 )/I 1 ]w 2 (t)w 3 (t) + n 1 (t)/I 1 
& 2 (t) - + n 2 (t)/I 2 

4> 3 (t) - + n 3 (t)/I 3 

The state space for the first set of equations is 3F(P( 3) — the set of 
3 by 3 orthogonal matrices — the state space for the second set of 

3 

equations is R — cartesian 3-space. For our present purpose suppose 
that the center of mass of the body is fixed and suppose that the 
observed output of this system is a pencil beam of light generated by a 
light source which is mounted in the body along a line passing through 
the center of mass. In this case the output is &X(t) where ? it a 
subgroup which corresponds to a rotation about the pencil beam (an 
undetectable motion) 



Figure 1 : Illustration of the observability of a rigid body. 
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Example 3 : (A model for DC to DC conversion) The electrical network 
shown In figure 1 contains switches which are to be manipulated in 
such a way as to transfer the energy stored on the capacitor 1 to 
capacitor 2. In order to have a sensible physical model we demand 
that there be exactly one path through the Inductor at all times. 



Figure 2 : An electrical network for which energy is conserved. 
The equations of motion are 




0 

s x (t) 0 ' 


■* 1 (tr 

x 2 (t) 

m 

-S^t) 

0 s 2 (t) 


x 2 (t) 



. 0 

-s 2 (t) 0 _ 


,x 3 (t)_ 


where x^ ■ *3 “ *^2 v 3 and x 2 " i 2*^ an<1 8 1 an< * s 2 are ^®P cnd 

on the switch positions and take on the values 1 or zero. Ue have 

s^ • 1 and 82 ■ 0 and if the switch on the left is closed and we have 

X 1 " °» *2 * 1 the switch on the input is closed. 
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3. Lie Algebras and Lie Groups 

Let denote the set of real n by n matrices ; JR*** is a 

vector space of dimension n*. By a Lie algebra SB in P 0 * 10 we under- 
stand a subset of )J^ nXn which is a vector space and which has the 
property that if A and B belong to SB thus so does [A t B] ■ AB-BA. 

If SB^ and ^ are hie algebras in and their intersection 

^1 ^ ^2 is also a Lie Algebra since if A and B belongs to ^ and 
SB 2 a *»d Both are algebras then [A,B] belong to both 2?^ and 9B^, The 
union ^ U SB 2 of two Lie algebras, the sum SP^ + SB^ of two Lie 
algebras and the commutator [SBySf^l of two Lie algebras are not 
necessarily Lie algebras. 

Given an azbltrary subset of *] (^ nxn we can add additional elements 

to it so as to imbed it in a Lie algebra. To obtain the smallest 

Lie algebra which contains a given set .4 r we first add to *A' all linear 

combinations of elements in JP so as to get a real vector space 

Then commute elements in to get if this is 

not contained in then we form etc. Clearly 

this process stops In a finite number of steps since at each stage we 

Increase the dimension of the vector space by at least one and the 

2 

dimension is uppe. bounded by n . We call this Lie algebra the Lie 
algebra generated by A and denote it by 

If .At is a set of nonsingular matrices in ^ nXn we let iAt} r 
denote the multiplicative matrix group generated by *At, l.e. the 
smallest group in 10X11 which contains and is closed under multiplication 
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and inversion. If ./f' is a linear subspace of 7^ ,nXn then the set 

N 1 N 2 N » 

^ ■ {M : M « e e ...e p ; t JV ; p ■ 0,1,2,...} 

contains no singular matrices since det(expN^) « exp(trN^) > 0. 
Clearly M is closed under multiplication and inversion and in our 
notation 


•M ■ {exp^}g 

Let y be a Lie algebra. At each point N in {expi?}^ there is a 
one to one map ^ from a neighborhood of 0 in i? onto a neighborhood 
of M in {exp SP) G which is defined by 

(exp SP} Q ; <f^(L) * eHf 

This map has a smooth inverse which shews that jexpi?fg is a locally 

Euclidean space of dimension equal to the dimension of SP, Ne may 

• 1 00 

check that the maps satisfy the conditions for a C manifold in 

the sense of reference [11] (page 97). Thus we may give (expSP) G the 

structure of a differentiable manifold. This justifies our referring 

to {exp^}^ as a group manifold . 

If tx/ is a linear subspace of 7^ 0X11 which is not necessarily a 

Lie algebra we might inquire as to the relation between {exp«j/}g and 

{expW}^}^. Clearly the latter contains the former. The following 

theorem claims that they are identical. 
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Theorem 1 : Let «V, , . . . , *1 be a collection of linear subapaces of 

12 p 

VT 1 - Then 

{expeV^>exp . . . ,exp«.a^)g - (exp{«V^, 

Before proving Theorem 1 is is appropriate to make a few remarks 
about its relationship to the controllability literature. (Perhaps a 
glance at theorem 5 would help at this point.) In considering equation 

X(t) - ( l u.(t)A ‘X(t) 
i-1 1 x 

as a differential equation in &£(n) it is clear from the theorem of 
Frdbenlus [12] that the solution passing through X e Sl(n) lies in 
(exp {A^ ) A ) G X because at each point X in &£(n), {A^X ) A is an 
involative vector field which contains X and spans the tangent space 
of {exp{A 1 > A } G X at X. Wei and Norman [ 9] confirm this fact by 
giving (Locally valid) formulas for the solution of this differential 
equation in terms of the f unctions u ± ( • ) and the structural constants 
of the Lie algebra generated by the a^ (without pointing out the differ- 
ential geometric Interpretation of the result). On the other hand, if we 

m 

regard X(t) - ( J u.(t)A.)X(t) as a control problem then the natural 
i-1 1 

question is not what manifold contains the solution, but rather what 

set can be attained from a given point, given freedom over the choice 

of (u« ,U m* ... , u ). The results of Chew [1] (see also Hermann [2]) are 

applicable here. Chow showed under a suitable regularity condition that 

the set of points reachable for the vector system i(t) - ? u. (t)f. (*(t)] 

i-1 1 x 

using piecewise constant controls as the same as those points reachable for 
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x(t) - l v.(t)g.[x(t)J 
i-1 1 1 

where {gj (x)} is a basis for the involative distribution generated 
bytf^Cx)}. That is (x)} spam a vector space which includes (f^(x)} 
and is closed under the Lie bracket operation 


[f,g] 


3f 

9x * 


ax 


f 


In our case the Lie bracket of A^X and A^X is [A^,Aj]X. Thus we see 

that for the differential equation in question the reachable set 

Includes {exp {A. }.}_ and the theorem of Frobenius insures that It 
i A G 

Includes nothing more. 

The proof of theorem 1 given below could be shortened considerably 
by the use of these ideas. The reason for preferring the longer proof given 
here Is that It is constructive, it is self-contained (nothing harder 
than the implicit function theorem is used) and it has the merit of proving 
a theorem about n by n matrices using the notation and tools natural 
to that subject. 

Proof : Ue give a proof which relies on an implicit-function theorem 
which, under suitable hypothesis, insures the existence of a solution 
of a equations in 8 < a unknowns. (See reference [13] pages 29-30.) 

We also need the Baker-Hausdorff formula which asserts that 


e At Le~ At - L + [At ,L] + - (At,[At,Ll]+ [At, [At, [At, L] ))+... 

Note that the norm of the (n+l)th term in this series is less than 
| | L | |2 n | | A | | n /nJ so that the series is majorized by the stories 
||L||[1«||A|| + <2 ||a||> 2 /2!+...] - ||L||-. 2 H At ll 


and hence is 



V, - 


absolutely and uniformly convergent on -T < t < T for all T. 

Let {A 1 ,A 2 ,...,A n > C U ^ U ... be a basis for 
,V + a/ and let i? be the Lie algebra generated by 

1 2 p 

{A,, A ,A >. Assume this algebra is of dimension q. There 

Id- r 

exists a basis for i£ which consists of terms of the form 


L 


1 


■ A 


1 


H mk i 


L rH “ ^(r+D^Krfl) 1 
L rf2 “ 

L t+s * {A k(rfs)’ A l(r4m) 

L r+s+l " (A k(rWl)^ A l(rff+l) ,A «(HtH) J1 

L q " ^••*^ A k(q) A m(q)l***l 

We are quite explicit here because at certain points In our proof It 
is necessary to regard these expressions as formal expressions as 
opposed to matrices* 

We introduce the following special notation. The operator EXP 
maps formal expressions Into matrices* It Is defined on A^ end Its 
conuutators (l.e. formal expressions such as A^.lA^.A^], [A^Uj^]) etc.) 
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as follows 


EXP A £ t 


V 


EXP[A 1 »Aj ]t - 


A./t A ft -A./t -A /t 
e e J e*e J t * 0 

A /ft! A/ftf -A/ftf -A/ftf 
e J e e J e t<0 


The definition Is completed by recursion. If B Is a commutator 
expression then 


EXP[A 1# B]t 

and 

EXP[B,A t ]t 



A Vt -A Jt . 

e (EXP B/t)e 1 (EXP B/t) 

_ _ A /ftf -A /ftf 

EXP B/ftfe 1 ^EXP B/ftf)'^ 1 


A./t . -A.vT 

(EXP B/t)e (EXP B/t) e 1 

A./| 1 1 -A /ftf , 

e 1 (EXP B/ftf)e 1 (EXP B/ftf)” 1 


t > 0 
t < 0 

t * 0 
t < 0 


It Is an easy calculation to verify that EXP[A^,A^]t • I+(A^,Aj ]t+o(t) 
where o(t)/t goes to zero as t goes to zero. Ve now show that in 
general if B is a commutator expression then 


EXP Bt * I + Bt + o(t) 


To carry out this proof we use induction. Assume that the result 
is true for commutator expressions B with n-1 brackets we will show 
it true for those containing n brackets. We write B as B ■ [A,C] with 
C having n-1 brackets. (Similar calculations cover the case B ■ [C,A].) 
By the induction hypothesis 


EXP Ct - I + Ct + o(t) 
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Of course EXP Ct can be expanded in a convergent power series 

Involving fractional powers of t. If G(t) denotes the terns 

1 2 

involving powers of t between t and t and if F is the coefficient 
2 

of t then 

EXP Ct - (I+Ct4C(t)+Ft 2 +o(t 2 )) 

The power series expansion for the inverse of EXP Ct is then 

(EXP Ct)" 1 - I-Ct-G(t)+(C 2 -F)t 2 +o(t 2 )) 

as is verified by multiplication with the expression for EXP Ct 
itself. Now for t nonnegative 

EXP Bt - EXP[A,C]t 

- (I+Ai/frfA 2 t/2+o(t)) (I+C*^H»(*^t)4Ft+o(t)) 

(I-A^t+A 2 t/2fo( t) ) (I-C/t-G(i^t)+C 2 -F) t+o( t) ) 

- I+[A,C) t+Ft+A 2 t/2+A 2 t/2-A 2 t+(C 2 -F) t-C 2 t+o( t) 

- I+lA # C)t+o(t) 

and the case t < 0 leads to the same result. 

Hell known properties of the matrix exponential function let 
one conclude that for |t| >0 EXP Bt is continuously differentiable. 
The above argument shows that EXP Bt is differentiable with respect 
to t in a neighborhood of t * 0 and 

■4 EXP It I - B 
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Hence we have for the basis elements L , 


-r 1 EXP L t - L 

dt v l t -0 u 

Now consider a function of u * (u, ,u„,. . . ,u ) and v ■ (V t ,v«, . . . ,v ) 

12 q 12 q 

which maps 7^ x 7R^ Into 7 R IUCn and which is defined by 

F(u,v) - -I+(EXP L.u.) (EXP L 0 u~). ..(EXP L u ) e " L l Vl ' L2V2 * * 

11 ^ z q q 


Clearly F(0,0) » 0. Now the linear approximation of F at (u,v) ■ 
(0,0) Is given by 


F (u,v) (u>v) 1(0.0) (Su,6v) ■ L 1 5 u 1 +L 2 ,5u 2 + - • -VVVVVV" -- L q 6 * q 

so that the range space of F^ ^(u.v)^ ^( u,0) is the q-dimensional 
subspace of R nxn spanned by {L^}. Now F(u t v) + I is a finite product 
of exponentials which we write as 


P 1 p 2 

A i V A i u i 

‘l *1 x 2 l 2 

F(u,v)+I - e A e c z ... 


\\ -h’rtY- 

e e 


L v 
<1 


Since the Baker-Hausdorf f formula lets one write 
A.t -A.t 

6 1 A^e m \ + + ••• 

A.t -A.t 

we see that e A^e belongs to the Lie algebra generated by the 
A’s. Moreover, it is continuous with respect to t and at t » 0 
takes on the value A^. Using this result repeatedly we see that 
for each (u. } we can find R. in £? such that 
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P 1 p 2 

A i U i A i V 
X 1 X 1 x 2 x 2 
e e 


'k 


? kfl 


\\ V\ + i \\HWr-V , 

»6 A, e • • • © 6 

Tc 


(F(u,v)+I)R (u ,u ...u ) 

X k X 1 X 2 \ 


simply by pushing past the exponentials one at a time. Clearly 

R. (0,0,...,) ■ A . Thus we see that for a and b small 
X k x k 

F, V \( u * v >| (6u,6v) - (F(a,b)+I)( J 6 1 (a,b)6u +H (a,b)6v ) 

lu * V ' I (a,b) i-1 11 1 


for some S^(a,b) and R^(a,b) in i?. Since S^(0,0) * H^(0,0) » 
and since and depend continuously on their arguments this 
establishes that the Jacobian of the map F : 7*?^ x + 7 ^ um 
must have rank q in a neighborhood of (0,0) and hence by the implicit 
function theorem cited earlier there exists an e > 0 and a map 
4> : R** -*• R* 1 such that if | |vj J < e then 


F($(v),v) - 0 


Since F(u,v) ■ 0 implies that 


Li v i+l 2 v 2 +...+ l q » )) 


EXP L.u.EXP L-U-...EXP L u « e 
ll q q 


We conclude that there exists > 0 such that If L e 


and | |L| | < we can write 


A i u i A i u i A i u i 
L X 1 X 1 l 2 x 2 _ x v x v 
6*6 6 »••« 


Mow for any L t SB It follow that 1 L| | < for some Integer 


and thus we can express L as expL/m*expL/m.» .expL/a. Likewise we 
L 1 L 2 L o 

can express e e ...e H in this form. 
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Let M and <f be Lie algebras in 'TF ** Xi • It can happen that 
{exp Jtf) G is a bounded subset of /fi* 13 ” 1 which is not dosed, and it 
can happen that the closure of {expjC}^ equals {expi^}^ with M t SB . 

The skew- line on the torus [11] is an easy example. Also, {expJif}^ 
is not necessarily simply connected. Nonetheless, we have the following 
result which we deduce from theorem 1 rather than sending the reader 
to the literature. 

Corollary 1 : If C/C and SB are Lie algebras in uf*** then (exp X) G 
{exp ?'}q if and only if Md ( -£ and {exp,/C} G ■ {exp £) G if and only if 

JC - <£. 

Proof : For both statements the sufficiency is obvious. To establish 
necessity in the first case notice that if {expJ^) G CZ {exp SB)q then 
by theorem 1 

{expfX) G » {expjf, exp SB} Q * {exp{Jf, ^) A > G 

Suppose SB is of dimension n. To obtain a proof by contradiction, 

suppose that C/C is not contained in SB* Then {X t j 2?}^ is of dimension 

n+1 or greater. Then exp 3? is an n dimensional manifold and 

{exp{jf, £}} is not which contradicts {expi?'} 3 {expUB To 

establish necessity in the second case repeat this argument verbatum 

but with "contained in" replaced by "equals" both verbally and symbolically. 

The notation ad^B ■ B, ad^B ■ [A,B], ad^l ■ [A,[A,B]] etc. is 
standard. If C/C and SB are Lie algebras we use the notation {ad^ 7 ^') A 
to denote the Lie algebra generated by C/i under commutation with 
elements of !£. That is 

{ady,«Nf} A - {^,[« , ,.>jr],...,[y,[v , ...[i? — > A 

This algebra may also be described as the intersection of all Lie 
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algebras which contain Jfc and are closed under commutation with 
If 3' and ( S are groups we Introduce an analogous notation. The 
smallest group which contains & and all products of the type GFG * 
for G in $ and F in ^ will be denoted by {AD^^Oq* This group 
may be described as the Intersection of all groups which contain 3 
and are closed under conjugation with elements of 8 . If & is 
{exp ) G and 3 Is {expjtf ) G then clearly {AD^ &}q consists of products 
of terms of the form 


II L l K 1 - L 1 L 2 K 2 - L 2 

M » e e e eee 


L K 
m i 

e e 



Theorem 2: Let and <£ be Lie algebras in'7? lxn . Then 


{ u 

Me(expi?V 




and 


^ AD {exp5?> G ^ eXp ^G^G " 

Proof : From the Baker-Hausdorf f formula we see at once that If L belongs to 

3E and K belongs to Cf( then eSce” 1 ’ belongs to {ad^J^}^ Thus the right side 

of the first equality in question contains the left. On the other hand, 
expressions of the following type belong to the left side. 
e aL £ Ke" aL - — K * lL,K] + o(a) 

S^C £ [K.L]+o(o))e“ C,L - i [K,L]+o(a)-{L{L,Kn+o(a) 


etc. 


Since is a finite dimensional space and since 
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{ U 

Me {exp r £}^ 

Is a linear subspace, it is closed. Thus [L,K] [L[L,K]]... etc. 
belongs to this set and the first equality Is seen to hold. 

The second statement Is obtained by exponentiating the first. 

This gives 

{exp{ ^ M2?M 1 }.} r - (exp{ad 

Me<e*pS?) G AC ? K 

but since e^e^e ^ ■ exp (.** K e we see that 
so the result follows. 

The next theorem states a purely group theoretic result which 
although easily proven. Is stated formally because we need It In cur 
study of observability. 

Theorem 3 : Let 8€ and 8 be subgroups of a group 8* Let 8 be the 
subset of 8 defined as 

8 - {P : RPR -1 c 8e ; all R e 81} 

Then 8 Is a subgroup of 818* Is a subgroup of <8 and 8 Is a 
normal subgroup of 88. Thus 8 0 8 Is a normal subgroup of 8 and 
8818 Is Isomorphic with 818 f) 8 . 

Proof : Suppose and P 2 belong to 8 then for each R in 8 there exists 
H X (R) and H 2 (R) in H such that RP 1 R" 1 (RP” 1 R" 1 ) - Hj(R)* [H 2 (R 2 ) J" 1 . Since 
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'/£ < Is a group this means RP^Pj^R 1 belongs to 8t and thus tY it 8 is 
a subgroup of . Clearly it is a subgroup of 8C since the choice 
R • I is possible. To see that 818 is a group, note that if and 
R2 belong to 8t and and Pj belong to 8 then 

R 1 P 1* R 2 P 2* " R 1 P 1 R 2 R 2 P 2 R 2 " R l*2 ^ R 2 P 1 R 2 ^ R 2 P 2 R 2 * 


Since 81 is a group and since 8 is a group which has the property th»l 
if P belongs to 8* then so does RPR 1 for each R in 81, we see that 
this product belongs to 818 • Clearly 8 * ia a normal subgroup of 818 * 
since RP^P^R -1 - 8 for each RP in 88, By the second isomorphism 
theorem (Rotman [ 14] , page 26 ) 8t f\ 8 is normal in 8t and 
81818 » 8t!8t O 8 . 


V* now state and prove a Lie algebraic analog of this theorem. 
Algebraic tests for observability will be derived froia this result. 
Theorem 4 : Let Jtf.and 8 be Lie algebras in JR™ . •- 8 be defined 

as 

8 - {P : RPR” 1 e {expJt?} G , all R £ (expi?} G 


If 8C is a Lie algebra in then (expJ£ }^^8 if and only if 

{ad^,«#}^cr jjf. There exists a unique Lie algebra 8C^ such that 
{ad^#}^}^ Jjf and 8i^ contains all other Lie algebras having this 
property. 

Proof : Suppose {a Then for L^ in St and inJ£ we see 
from theorem 2 that {expJtf) G contains 


RPR 


-1 


L. L 


L K. K 


K -L 


“L. -L, 


12 pl2 q p _ 2 

e e ...e e e ...e e r ...e e 


By the hypothesis and corollary 1 {exp{ad^j^} A ) G CZ {expj^/g. 
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On the other hand. If for all In .5? and all In ve have 


RPR 


-1 


l X L 2 


L- L K K 0 N -1* -L, -Li. 

e . ..e **e e . ..e **e ^...e e € {exp«#l 


K -L 


-L„ -L, 


e 


then since {exp.J/’Jg is a group we see that (ADr^^. j {expJiOg }g CZ 

” G 

{exp/T} ( . and again from theorem 2 and corollary 1 we see that 

{ad^) A c 


Finally, notice that if {ad^, # ^ k <Z # and {ad^JT^C^ 
then Jf thus there is a largest Lie algebra with 

this property. 
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4. Controllability on Group Manifolds 

The first question of a system theoretic character which we 
investigate is that of controllability. Since we want to emphasize 
global results we work with the most elementary type of evolution 
equation appropriate to our present steelng, namely 


X(t) - (A + Iu i (t)B 1 )X(t) 

The choice of control affects the direction in which X moves. How- 
ever A is a constant over which there is no control. This evolution 
equation has the property that the change of variables X XP for 
P nonsingular, leaves the equation unchanged. This invariance gives 
the vector field which a given choice of (u^ (t) } establishes on 
G£(n) a particular simple form. 

Theorem 5 : Consider the linear dynamical system 


X(t) - ( l u (t)B )X(t) 
i-1 1 1 


X ■ n by n matrix 


Given a time t # > 0 and given two nonsingular matrices X^ and Xg, 
there exists piecewise continuous controls which steer the state 
from X^ at t ■ 0 to Xj at t ■ t ft if and only if X£X^ belongs to 

{ * Xp{B l , A } G- 

Proof : (Sufficiency) Theorem 1 asserts that any matrix M in 
(exp{B^) A ) G can be written as a finite product, say 


®i °1 B i “2 B i “■ 
*1 l 2 l m 

6 6 a • *6 



M. Divide the interval 0 < t < t fl up into m equal 
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-1 

Intervals whereby t^ * i-t^/m. Let t a /m ■ 8 . On 

the interval [0 y t.) all controls are zero except the 1 th control 

1 ID 

which takes on the value a B* On the Interval [t, ,t.) all controls 
are zero except the i^jth which takes on the value etc. 

down to the last Interval on which all controls are zero except 
the IjSt which takes on the value otjB. Since the differential 
equation is linear and constant on each of the sub Intervals the solution 
is a product of exponentials and the result follows. 

(Necessity) To show that X. cannot be reached from X. unless X«X~* is 

L L . L, 1 z 1 

s m-1 X 

of the form e e ...e we assume the contraty and obtain a con- 
tradiction. Suppose that u^(*),... t u(») is a control which steers the 
system from X^ at t • 0 to Xj at t ■ t ft by theorem 1 of [9] we know 
that there exists a sequence of times t y t.,t 0 ,...,t such that on 

O 1 4 IB 

each of the sub intervals [t. ,t^ + ^] the transition matrix of 

X(t) - ( f u. (t)B.)x(t) 
i-1 x x 

H.(t) 

can be written as e for some H^( ) in SB . Thus we can write 

L L , L, 

X(t ) - e m e ®“...e a 
& o 

which establishes the contradiction. 

As an application which emphasizes the ease w' which we can 

study global questions using this theorem we observe the following 

results relating to the classical groups. Here J is given by 



and a matrix is called symplectic if 8*J0 ■ J. 
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Theorem 6 : Consider the system of Theorem 5. Given a time 
> 0 and given two nonsingular n by n matrices Xj and X2 with 
det XjX 2 > 0, there exists a piecewise continuous control which 
steers the state from X^ at t • 0 to X 2 at t * t # if {B^}^ 

i) spans '7/? ,uai 

ii) spans the n^-1 dimensional subspace of TR™*** consisting 
of the zero trace matrices and det X^ ■ detX 2 . 
ill) spans the n(n+l)/2 dimensional subset of J? 0 * 0 consisting 
of the set of matrices which satisfy JA+A'J » 0 and 
is symplectlc 

iv) spans the n(n-l)/2 dimensional subset of ^ nxn consisting 
of all ske w- sym metric matrices and X^X^ is orthogonal. 

Proof : As is well known any nonsingular matrix can be written as OR 
with 8*6 • I and R * R* >0. Also real orthogonal matrices with 
positive determinants and real symmetric positive definite matrices 
have real logarithms. Moreover in case ill) the factors in the polar 
representation inherit the property of the group itself, which is to 
say that the and R in the polar representation of a symplectlc matrix 
are symplectlc. To complete the proof we need only Invoke theorem 5 

-1 Q s 

since the previous remarks justify our writing XjX ■ e e with 
ft * -(2* and S - S* both in the appropriate Lie algebras. 

The results of theorems 5 and 6 are somewhat unsatisfactory in that 
the A term is absent. The following theorem describes one way in 
which this can be relaxed. 
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Theorem 7 : Consider the linear dynamical system 


X(t) - (A + l u (t)B )X(t) ; X - n by n matrix 

i-1 


Suppose that [adVB ,B ] ■ 0 for i, J-l, 2,...v and k-0, 1, ...n -1. 

a l j 

Let be the linear subspace of R nxn spanned by adjtB^ for 1-1, 2,... v 

2 

and k*0, 1, ...n -1. Then given a time t >0 and two n by n matrices 

3 

X^ and there exist continuous controls which steer the system from 
the state X^ at t*0 to the state X^ at t ■ t fl if and only if there 
exists H in '/£ such that 


X 2 - e Ata e H X 1 


Proof : First of all, notice that 


d k_1 .At, 


-At 


±-r [e At B.e' At ,Bj - “j— y te~{A.B )e ~, ] 

dt k 1 i t-0 dt k 1 1 i |t-0 


^5 I* At (4i) e ' At -V 


t-0 


- [e At (ad k B )c' At ,B ] | 

Ai J |t-0 

' [ad X' B j 1 

Thus le^B^e”**,®. ] is identically zero if [ad^B^,Bj] - 0 for k-0, 1,2,.... 

2 

However, ad^ is a linear operator from an n dimensional space 

into itself so that by the Cayley-Hamilton theorem all powers above 

2 2 
n -1 are linearly dependent on the first n -1. Thus under the hypothesis 
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of the theorem statement te At B 1 e" At ,Bj] vanishes identically. Also 
0 - e At B 1 e* At B j -B J e At B 1 e“ At 

- e Ao (e At B 1 e' At )B j e" A0 -e A0 B J e At B 1 e" At e‘ Aa 
Now let t+o - $ and y * o. Thus for all 6 and y 



For the purpose of solving the differential equation we introduce 
Z(t) * e* At X(t) and observe that 

Z(t) - ( l u. (t)." At B.e At )Z(t) 

1-1 

But recall (see e.g. Martin [15]) that the solution of Z(t)«B(t)Z(t) 

is exp I B(o)d(J if [B(t) »B(o) ] vanishes for all t and o. Thus we 
'0 

can write 

Z(t) - e*p<r I u.<t)«' At B.e At dt)Z(0) 

J 0 1-1 1 

It is a well known and frequently used fact (e.g. [16] page 79) 

that the image space of the map taking continuous functions into 

1 Act 

e bu(o)do, is spanned by 

J 

the first p derivatives of e At b evaluated at zero. Using this fact 
here we see that for each H in and each t > 0 we have a continuous 

o 

u defined on [0, t^] such that 

Z(t a ) - e H Z(0) 


Tf? according to the rule x ■ L(u) 


I 



- 25 - 


Therefore in terms of X we see that we can reach at t fl using X which 

At H 

can be expressed as e a e x(0) with H in 

As an application of this result we derive a familiar relationship. 
Example 4 : Consider the system in 

m 

x(t) • Ax(t) + l b u.(t) ; x(0) - given 

i-1 

Related to this is the matrix system in 


X(t) - £ 0 0 J X(t) + ^ ^(t) X(t) 


Let A and be the matrices appearing in this expression. In this 
It 

case [Ad^(Bj),Bj] vanishes as required and so the reachable set 
from x(0) * I is 


o](exp H ; H £ /} 


where ,'Jf is the subspace spanned by Ad^. A computation gives 

-C *>] 


so that the reachable set at t is 


r« At H 1 n-L 

M - {X : X • I I ; He Range, B,AB,...A b} . 

L 0 lj 

where we have used the fact that e^H * H for all t and all H in 
Range (B ,AB , . . . , A ^B) . 
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5. Observability 

In order to get a theory having a scope comparable to linear theory, 
it Is necessary to treat observability. The choice of an appropriate 
form of the observational equation is critical for the success of the 
overall theory. As it turns out, the natural choice is indicated by 
the second example in section 2. 

Let & be a matrix group and let ^ be a subgroup. Consider the 
system evolving in & 

v 

X(t) - (A + l u.(t)B.)X(t) ; y(t) -«X(t) 

1-1 1 1 

by which we mean that instead of observing X(t) directly, we observe 
what equivalence class X(t) belongs to with respect to the equivalence 
relation in defined by . Thus y(t) takes on values in the coset 
space #7# which is generally not a group manifold, (see section 7) 

We call two states X^ and X 2 distinguishable if there exists some 
control which gives rise to different outputs for the two starting 
states. In general the zero control is not adequate to distinguish 
between all states which are distinguishable as contrasted with the 
situation one finds for linear systems. 

Theorem 8 : Let f be a matrix group and suppose that the set of 

points .‘W reachable from the identity for the system 

v 

X(t) - (A+ l u (t)B.)X(t) ; y(t) - tfX(t) 

i-1 1 1 

is a group. Then the set of initial states which are indistinguishable 
from the identity is given by 
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.? - {P : RPR -1 e 'g for all R E .f } 

& is a normal subgroup of and a subgroup of c £. 

Proof : Suppose that X is a starting state for the given equation which 
is indistinguishable from the identity. That means that for each R 
in < J /? there is C(R) in ^ such that 

C(R)RX - R 

since and ^ are groups we can take inverses to get 

R X R" 1 E # 

Thus the set & is exactly those states indistinguishable from the 
Identity. The remainder of the conclusions come from theorem 3. 

Theorem 9 : Let and & be Lie algebras in and suppose that 

all the points reachable from the Identity for 

v 

X(t) » (A + l u. (t)B.)X(t) ; y(t) ■ {exp.Jtf'Lx(t) 

i-1 1 1 G 

is {exp ( 4 } G * Then the set of initial states & which are indistinguish- 
able from the identity contains {exp«7H r if and only if (ad «#} . C yC . 
Therefore a necessary condition for all states to be distinguishable 
from the identity is that J contains no subalgebra J X such that 

{ad^jtf } A d ,ye. 

Proof : Theorem 8 gives a characterization of ^ which permits one to 
bring to bear theorem 4. Theorem 4 immediately gives the desired result. 

One might be tempted to conclude that if there is no nontrivial 
algebra meeting the requirements of theorem 9 then all initial states 
are distinguishable. This is not true because & can be a discrete 
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subgroup and hence not trivial and yet not expressible as {expJOg 
for any Lie algebra X. The next example illustrates this. 

Example 5 . In the numerical integration of the equations of motion 
of a rigid body one usually avoids Euler angle representation and 
uses instead quaternion or direction cosine representations. As is 
well known, the group of unit quaternions covers twice. 

This causes an ambiguity in going from <^(3) to the group of unit 
quaternions. This example illustrates this idea. Consider an 
equation in the group of unit quaternions 2 which we parameterize 
in the usual way (a^4b^fc^+d^ ■ 1.) 



where is the subgroup given by 


r/» 

e> 


* {expJ*0 G ; 


•o 

-a 

0 


a 0 0 
0 0 0 
0 0a 


L0 


0 -a 0J 
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Now it is true that {exp includes I and -I and it 
that this pair of elements form a normal subgroup of 
as an initial state cannot be distinguished from -I. 
no nontrivial Lie algebra Ji such that {ad^o£}^C .'tf 


is also true 
Thus I 

Yet there is 
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6. Realization Theory 

Cne of the central results in linear system theory is the fact 
that any two time invariant, controllable and observable realizations 
of a given time- invariant input-output map are related to each other 
in a very simple way. , ur purpose here is to establish a similar 
theorem in this context. 

Suppose we have two systems 
* 

X(t) - ( l u i (t)B i )X(t) ; y(t) «#X(t) 
m 

z(t) - ( l u.(t)G.)Z(t) ; y(t) t) 

i-1 1 

We assume that 1) the systems are observable in the sense that no two initial 
states give rise to the same response y for all piecewise continuous 
inputs, and 11) that there exist one to ore maps say c(‘) and h(*) both 
mapping into a set S such that if each system starts at the identity 
state and if each system receives the same lnput > c^K(t)) * h(Jfc'z(t)) 
for all future time. A pair of systems me ^ng these criteria will 
be said to be observable realizations of tho same input-output map . 

We emphasize that X(t) and Z(t) are squ're matrices but not necessarily 
of the same dimension. 

Suppose we have two observable realizations of the same input- 
output map. Let u(-) be a nonzero piecewise constant control defined 
on [0,1] which when applied to the X system takes the state X(0) ■ I 
into the sta a X(l) - 1. Then of course it must do the same for the 
Z system because they are observable realizations of the same input- 



31 - 


output map. Thus we see that if • • • *1^ is a collection of 

integers with 1 i i fc < m and if are any real numbers such that 

a B a.B a B 

1 h 2 l 2 q 4 q _ 

e e . . .e * I 


then 


a.G J a.G a G. 

1 h 2 h 


e e 


• • • € 


Let L.,L.,...,L be a set of commutator expressions in B. , B~,...,B 
l z r 1 z tn 

such tha' {L ± } forms a basis for {b^} a . Let K^,K 2 »...,K r be in an 
analogous expression obtained by replacing B^ by G^, B 2 by Gj, etc. 

Let S be an arbitrary commutator expression in B. ,B., . . . ,B and let 
T be the analogous commutator expression in G^.G^, . . . ,G m . Then in the 
notation of the proof of theorem 1, there exists differentiable functions 
c^Cp) such that for [p| small 


EXPa 1 (p)L 1 EXP a 2 (p)L 2 ... EXP a v (p)L r » EXP pS 


EXPo^Cp)!^ EXPa 2 (p)K 2 ... EXPa(p)K r - EXPpT 
Since the are differentiable we can write (prime denotes derivative) 


I + p l a'(0)L + o(p ) « I + pS + o(p ) 

i-1 


I + p l a'(0)K + o(p 2 ) - I + pT + o(p Z ) 
i-1 


Thus if 


s - l YjLi 

i«l 


then 
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T * l Y,K t 

i-1 1 

From this we see that the algebra {G^) a is generated from {B^} 
in exactly the same way as the algebra {B^) a is generated from 
{B^} and thus that the algebras are isomorphic. We summarize 
this discussion with a theorem. 

Theorem 10 : Consider the two systems 


X(t) - ( l u. (t)B.X(t) ) ; y(t) - S?X(t) 

i-1 

Z(t) - ( l u,(t)G z(t)> ; y(t) -^(t) 

i-1 1 


where X and Z are n by n and q by q respectively. Suppose that these 
systems are observable realisations of the same input output map. 

Then {B^^ and {G^) a are isomorphic as Lie algebras and moreover if 
Lj,L 2 »...,L r are commutator expressions in (b^) which form a basis 
for {B 1 > a and if are the analogous expressions in G 

obtained by replacing B^ by then is a basis for 

and if 



Then 


|L i’V * Y Uk L k 

l W * j x 

Of course this does not mean that the reachable sets from I namely (exp{B.}.}_ 

1 A G 


and {exp{G^} A }g are Isomorphic as groups. For example the group of unit 

quaternions and the group of 3 by 3 orthogonal matrices have isomorphic 
Lie algebras yet they are not isomorphic as groups. 
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7. System Theory on Coset Spaces 

In this section ve reinterpret our results in a somewhat 
different way. This interpretation leads to some facts about 
systems on manifolds which do not admit a group structure. In 
particular we have in mind the n-sphere S n • {x:x'x*l, x € 77^ + ~} 
which, as is well known, does not admit a Lie group structure 
except for the cases n»l and 3. 

Let M C- 7^ n be a manifold. Let ( g be a matrix group in 7^ ,nXn . 

We say that ( g acts on M if for every x e M and every G z g , Gx 
belongs to M. By the orbit of g through x we mean the set of points 
•gx * {y:y-Gx, G e g). We say that g acts transitively on M if it 
acts on M and if for every pair of points x, y in M, there exists G 
in g such that Gx * y. If ( & acts transitively on M then at any 
point x e M there will be a subset such that for each 

H £ , Hx - x. Clearly if ^ e yf^ and H 2 e the- H^x - 

H 1 x ■ x and H ^x * x so that .'sf is a subgroup. We call .yf the 
isotropy group at x. Notice tha„ if Gx ■ y then y « G.j^x • 

G.'/f G ^y and thus G yf G * is the isotropy group at y — all iso- 
tropy groups are conjugate in g . Now suppose M is a manifold for 
which there actually exists a group <3 acting transitively. Pick a 
point x e M. Define in g an equivalence relation whereby G^ " G 2 
if and only if G. - G-H for some H z .yf . There is a one to one 
correspondence between this space of equivalence classes, gift , 
and M. In this case we call M a coset space . 

We study systems in which the state is represented as an n-vector 
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and the evolution is governed by 

v 

x(t) » (A + l u. (t)B.)x(t) ; y(t) - #x(t) (*) 

i-1 

By *<^x(t) we mean an equivalence class of vectors, x^ being equivalent 
to X 2 if and only if Cx. - X 2 for some C in 

Let £ be the • . ^.’gebra generated by {A.B^} and let M c I be 

a manifold such that {expi^L. acts on it. Then the above equation can 
be thought of as evolving on the manifold M c: TJ^ for if x(0) e M 
then regardless of the control, x(t) e M for all t > 0. If there exists 
a differentiable manifold MC such that {ex P&}q acts on M then 
we will say that (*) is well posed on M. 

Example 6 : Consider the n-sphere, S n . Let B^jBj* • • • *B m be n+1 by 
n+1 skew symmetric matrices. Clearly the system 

m 

x(t) - [ l u. (t)B ]x(t) ; y(t) - 

i-1 

is well posed on S n since (exp3?}g consists of orthogonal matrices 
and orthogonal transformations preserve norm. If we can observe only 
the first component of x then we should let be the subsets of 

<%(n+l) consisting of those matrices which have a 1 in the first 
column and first row. That is 





0 " 
■ C M n)_ 


With respect to .controllability we can say given any two vectors 
x^ and X 2 in there exists a piecewise continuous control which 
steers the system from x^ to X 2 if and only if x^ - Rx^ for some R in 
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{exp '/ }q where is the Lie algebra generated by {A^}. Also, an 
arbitrary point can be transferred to an arbitrary point if and only 
if (exp '/ ) G acts transitively on S n . 

At the same time we might observe that any x q such that | |x q | | » 1 
can be transferred to any x^ such that ||x^| | » 1 if and only if 
{exp <£} acts transitively on S n . This second point of view is useful 
because it puts the problem of controllability on S n in contact with 
standard results in geometry. In particular a great deal is known 
about Lie groups which act transivitely on S n . [Samelson [17] page 26]. 

As for observability, we note that two initial states x^ and x^ 
in S n give rife to the same y if and only if for all R in {expi^Jg 
there exists C(R) in such that Rx^ * C(R)Rx 2 which is to say that 
R _1 C(R)Rx 2 - 

We now abstract from this example the essential features and 
state formally a result which summarizes the development. 

Theorem 11 : Consider the dynamical system (x(t) £ 77^*) 

*(t) - ( 2 u. (t)B )x(t) ; y(t) • {expjfc^ r x(t) 

i-1 1 1 G 

which is well posed on the manifold M c: Z^ n . Let 2? be the Lie algebra 

generated by {B^}. A given state x^ is reachable from x^ if and only 

if x 2 ■ Nx^ for some N in {exp^ 7 }^. Let & ■ {P:RPR * c {expJfc}^ 

all R £ expV )}. Two states x^ and x 2 are indistinguishable if and 

only if x 2 * Px^ for some P in In particular, two states x^ 

and x 2 are indistinguishable if x 2 ■ Px^ for P in {expJ^J^ With 

being any Lie algebra such that {ad 
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Example 7 : Consider the submanifold M of consisting of those 

points whose last coordinate is 1. The evolution equation in M 


d rx(t)"| TA Bu(t)“|pc“ 

Ho oJU : 



corresponds to the more familiar x(t) ■ Ax(t) + Bu(t) ; y(t) • Cx(t). 
Using theorem 7 we see that for the associated group equation 


X(t) 



m 


+ l 

i-1 



X(t) 


the reachable set at time t consists of those matrices which can be 
written as 


.ft - {X : X 



:]• 


x e range (B, AB,...,A n ^B)) 


Thus if B,AB,...,A n ^B spans 7$ then the reachable group acts 
transitively on M and we have controllability. 

As for observability, we note that 


( 0 KerC \ /I KerC \ 

0 0 / '0 1 / 

The subalgebsas of W which are closed under commutation with <£ 
correspond to the linear subspaces of KerC which are invariant under A. 
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The Control Problem x = (A(l-u)+Bu)sc : A Comment on an Article by J. Kucera 

★ 

Hector J . Sussmann 

In an article recently published in this journal ([1])» J* Kucera 
studied the control problem 3 ■ (A(l-jj) + Bu)x. The main results of [1] 
are that the set «V(u),T) of points attainable at time T > 0 from a fixed 
point 0 ) is an "integral manifold of the distribution ^(A^B)" ([1), Theorem 
2.2), and that the set (w,T) - Ui^w.T) : 0 < t < T) is an "integral 
manifold of the distribution #(A,B)" ([1], Theorem 2.1). The purpose 

of this note is to show that Lemma 2.8 of il], which is a fundamental step 
in the proof of Theorems 2.1 and 2.2, is false. The natural question to 
be asked now is whether these results are nevertheless valid; It will be 
shown in a forthcoming paper that they are. The proof, however, is based 
on a completely different technique. 

We quote the statement of Lemma 2.8 of [1] : 

"{Let] T > 0, 4 e (0, | )i u e M(d,l- 6 ). Let the function u 

be not constant in <0,T> (not equivalent with & constant function ) , then 

00 

r(x(T,u))C \J r.K (T). M 
*” r -1 

The notations of the above statement have the following meaning: 

a) " (a,$)" (resp. <<*,$>) is the open (resp. closed) interval with 
endpoints a, 8 * 

b) "M(a, 6 )" is the set of all measurable functions in (0,«>) whose 
values lie in <a, 8 >, a < 8 » 

c) t -*■ x(t,u) is the solution of the equation 

jdx(t) - (*(l-«(t» + Bu(t))x(t) 

which satisfies jc(0) - b). Here u is a fixed element of R- (n-dimens ional 

*This work was performed while the author was at the Division of Engrg. and 
Applied Physics, Harvard Univ., Cambridge, Mass. The author's current address 
' is: Dept, of Math., Univ. of Chicago, Chicago, Illinios 60637. 
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real space), and A, JS are fixed elements of (the set of all n by n real 
matrices) . 

d) '5^' is the "distribution created by#?", i.e. the mapping that 

H Bt 

assigns to each x E the set VXxl of all elements of gr of the form 
Px, P c J. 

e) *#?", or '&(A,B)" is "the smallest linear space of n by n matrices 
which contains the matrix C[»B-A] and, with each P c3B, contains also 
both matrices (A,P) and (B,P)" (flj, Def. 2.2; the notation "(M^NJ" 
means "NM-MN"). 

f) X^(T) is the set of all vectors x^T.y), y e M(-A,l) , where 

x. (T,v) - X(T)(f~ jf 1 (t)Ot(t)y(t)dt) , 

~ “ J Q 


and where 

g) t •* X(t) is the n by n matrix-valued solution of 


X(t) - (*(1-U(l)i>+ gu(t))X(t) 

which satisfies X(0> - I, (I. is the n by n identity matrix). 

Ke shalJ show that Lemma 2.8 is false by means of a counterexample. 
Consider the space of all real polynomials in two noncommuting variables 
y,z. Form the space#’ by equating to zero all the monomials of degree 5 
or more (in other words is the free algebra over the reals generated 
by y and z, and #*18 the quotient of by the ideal generated by all the 

monomials of degree 5). Thus, #* is a 31-dimensional real vector space, 

2 2 3 2 2 2 2 

and the monomials l,y,z, y • y*» £/» z , y , y z, yzy , zy , yz , zyz , z y, 

3‘ / ‘»2 2322 2 2 223 2 

* tj : z, y zy, yzy , zy , y z , yzyz, zy z, yz y, zyzy , _iy , yz , zyz , 
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2 3 4 

■5 Z5» * y» _* are a basis for 9*. Moreover, ^is an associative algebra 

over the reals, with the obvious multiplication table (for instance: 

3 


.31 


2? * i*Y • zyz y> rjr • jr " 0, etc.). 

By means of this basis we can (and shall) identify with 
In 3 a t the mappings p (£,*.•} **■ y2(x»z) and p(y,z) *► zp(y,z) are linear. 

Via the above mentioned identification, we obtain two 31 by 31 matrices 
and such that these mappings correspond to x -*■ M^x and _x -*■ M^x, 
respectively. We let A. - M^, B “ + M^, so that C ■» M^. To begin 

with, we compute the space 35. It is clear from the definition that 35 is 
the smallest linear space that contains C such that, f f P c35 t then [A,P] 
and [C,P] belong to .Pi. Thus 35 is the linear hull of the set <? of ail 
matrices l<^, ^2** ‘ * * ^-k-l*%^‘ * * ^ ^ * w ^ ere ^ Is an integer > 0, and where 
“ A or_Q^ ■ C for i - l,...,k-l, - r Using the facts that iC,C] - 0 

and that [A, [C, [A,C]]] » [C, [A, [A,C] ] ] (an immediate consequence of the 
equality [P,P] • 0 and of the Jacobi identity [P»l3»R]] * £ [_P ,Q ] ,R] a* 

[Q, rp,R)]) we see that the following are all the elements of 3B corresponding 
to k < 4: 


“ C, M 2 - [A, C] , M 3 - (A,(A,C]J, M a - [CjA.C]], 

H 5 - [A,(A,[A,cni, M 6 - tC,(A,(A,C]l) and M ? - [C, [C, [A,C)] ) . 

In addition, all the elements or & corresponding to k * 5 vanish. 

31 

This is so because, via our identification of ^ with & (and of the 
corresponding identification of with the set of endomorphisms of the 
vector space ^0 , every element Q - [C^, [Q 2 » • • • , 19^-1 * * • 1 ) of <? corresponds 
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to the multiplication in & by a homogeneous polynomial p(Q) of degree k 
(for instance, if Q * [A,C) , then p(Q) * zy-yz). Since every homogeneous 
polynomial of degree > 5 vanishes in 3. cur assertion follows. 

Thus SB is the linear hull of We show that these matrices 

are linearly Independent. It is sufficient to prove that the r explications 
by the corresponding polynomials p(M^) , . . . are linearly Independent. 

If these multiplications were not independent, then tie images of the poly- 
nomial 1 would be dependent, i.e. the polynomials £(M^) , . . . ,pQ{^) would be 
dependent. Thus, it is sufficient to show that these polynomials are 
independent. F ^ p(M^) , . . . are homogeneous polynomials both in y 

and z, end no two of them have the same degrees both in y and z. Hence 
they are Independent. 

We have shown that , . . . ,My form a basis for JE?, so that 38 I ra dimension 
seven . 

31 

We shall take m to be the element of £ which corresponds to the 
polynomial 1 of 3. Thus, T'Cw) is the linear hull of p(H^) , . . . , 

and dimY1[(o) a 7. We take 6 to b? an arbitrary element of (O,-^), and 
define the contrcl u_ by 

u(t) - 6 + t for 0 < U T , 

where T - 1 - 26. Thus all the assumptions of Lemma 2.8 of [1^ hold. 

We show that the dimension of 7^(x(T,u)) is also seven. This is an 
immediate consequence of Lemma 2.11 of [1], or it can be proved diiectly 
as follows: the equation 
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^ • (A + u(t)C)x(t,u) 

Implies ' .e derivative of the polynomial x(t,u) is a polynomial 

in vichout a constant term. This 'mplies that the constant term 
of x(t,u) i9 1 for all t_ (because x(0,u) • w ■ 1) . From this it follows 
1* mediately that the seven polynomials pQi^)x(t,u) are linearly independent. 

** OO 

We shrll show that the dimension of the subspace (T) is not 

„ r-l- sl ~ 

greater than six. To begin with,LJ_r. K (T) is obviously the set of all 

r«l- **1} 

elements of the form 

X(T)(f~ x' 1 (t)QC(t)v(t)dt)a) , 

'o 

where v is an arbitrary bounded measurable function in <0,T>. We see 

immediately that this is the same as the linear hull of 

JC(T)a \t)CX(t)u>, t c <0,T>. The dimension of L" is the same as that 

of the linear hull of all the elements X *(t)CX(t)ti> (because ,X(T) 

is nonsingular), tonally, this dimension is not greater than that of the 

linear hull t or all the matrices _X ^(t)CX(t) , _t e <0,T>. 

Thus, it sufficient to show that dim J, < 6. Since u(t) - t^+ 6, 

the function t *► X ^(t)CX(t) • y(t) is analytic. Thus , L is tne linear hull 

of the coefficients of the power series expansion of jr in a neighborhood 

aft I 

of t - 0 or, equivalently, L is the linear hull of {*— y(0| t- Q : n-0,1,...}. 

Since -A X(t) ■ (A+u(u)C)Xtt) , we see that 

'll 

X _1 (t) - -X~ 1 (A+u(t)C) . 

If M(t) is any matrix-valued function, we have 

^(x" 1 (t)M(c)X(t') - -x‘ 1 (t)(A+uft)C)M(t)X(t)+J!‘ 1 (t)MCs)(A+y(t)C)*Ct) 

+X* 1 (t)^M(t ) • X(t) . 
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App lying this formula to the successive derivatives of Y we obtain 

X<t> - x“ 1 (t)F.(t)X(t) , 
dt- - 

where 

^(t) 5 C, F x (t) = [A,C], F 2 (t) - [«•(£«)£, [A.Cll , 

Fj(t) - [A+(t+i)C, [A+(t+d)C, [A,C] ] ]+tC, [A.C] ] , 

F^(t) - [C, [A+(t+6)C, [A.C] ] ]+2[A+(t+i)C, [C, I*,C] ) ] , 

Ij(t) 5 3(C,[C, (A.C)]] and F fi (c) i 0. 

In the above computations we have used the fact that every element of 
& corresponding to jt * 5 vanishes. 

jS. I 

Our computations show that, of all the matrices — — Y(t)|. , 

4 t “ * ' "- 0 

only those for n«0,...,5 are nonzero. Thus , the dimension of L. is not 
greater than 6. 

The proceeding remarks show that, in our example, the dimension of 

00 

^(X(T,u)) is seven, while the dimension of r.K y (T) is less than six. 

Therefore, the conclusion of Lemma 2.8 of [1] does not hold, even though all 
the assumptions are satisfied. Thus, Lemma 2.8 of [1] is false. 

Reference 

[1] J. RuJ*era : Solution in large of Control Problem^ ■ (A(l-u)+Bji)x. 
Czech. Math. J. 16(91), 1966, 600-623. 



THE BANC -BANG PROBLEM FOR CERTAIN CONTROL SYSTEMS 


IN GL(n,R) 

H. T. Sussmann * 


ABSTRACT 


m 

We discuss the linear control problem X(t/ = [A n (t)+ 2 uJt)A.(tj] 

u i-1 ^ - 

X(t). where A., . . . ,A are nxn matrix-valued functions of time, and 
— - ’ — 0 7 m — ~ 

where X(i) € We show that the set attainable from any element 

M e GL(n,R) at time t_ by "bang-bang M controls is closed, provided 
tne following very strong assumption is satisfied: for ali i, j and for 
all t^, _t" such that 0 ^ t ^ t, 0 ^ t" < t, the matrices A.(tJ) and 

A.(LL') commute. We also show, by means of counterexamples, that 

J 

these assumptions cannot be weakened. 


*This work was performed while the author was at the Division of 
Engineering and Applied Physics, TT arvard University, Cambridge, 
Mass. It was supported by the U.* S. Office of Naval Research under 
the Joint Services Electronics Program by Contract NQ00I4-67-A- 
0298 - 0006 . 

• -.-Department of Mathematics, University of Chicago, Chicago, 111. 
60637. 
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1. Introduction 

Recently, interest has arisen in the study of the linear control 
problem in manifolds and Lie groups (Brocket* [l], Haynes and Hermes [4], 
Kuifera [5] and [6]). Such a control problem is of the form 

m 

x(t) = Xq (*(*)) + 2 u.(t) X. (x(t)) , 

where x(t) denotes the tangent vector to the curve t "♦ x(t) at t = t, 
and where X„. .... X are vector fields. 

A particularly important case is that in which the manifold is a 
Lie group, and the vector fields are translation-invariant. When the 
Lie group is GL (n, R), the problem takes the simple form 

IS 

X(t) = 'A o +2 u. (t'A.) X(t) , 

— u . , -i — 1 

i-l 

where A_. . * . , A are constant matrices. 

— Cr 7 — m 

The purpose of this article if to indicate what hopes there should be 
of building a reasonable "bang-bang n theory for this problem, and for the 
more general one in which the matrices A^ are time-dependent. The 
important issue is, as usual, to determine whether the attainable set at 
time T^ is closed. It might seem likely that the tool to be used should 
be some generalization of the well known theorem of Lyapounov, which 
has proved so fruitful for similar problems (Lyapounov [?], Halkin [2] and 
[3]). However, as we shall show, not much is to be expected in this 
direction. We shall prove, that, under certain very restrictive conditions, 
c. sed.uess of the attainable set follows by a straightforward application 
of Lyapounov's theorem. We shall also show that, if these conditions are 
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weakened, it is possible to give examples of control systems for which 
the attainable set fails to be closed. 

The new aspect that plays a fundamental role is that of the 
commutativity of the matrices A^. The very restrictive assumptions to 
which we have referred are the following: 

a) The condition that [A.(0, A.(t*)] ^ should vanish for all i, j, t, 
t ' . This will guarantee that the set attainable at time T by "bang-bang" 
controls is closed, provided we define a "bang -bang" control as a 
measurable function with values in the set {-1, l} . 

b) The additional condition that the functions A^(t) should be piecewise 
analytic. If this is true, we will be able to get closedness even if we 
restrict the class of "bang-bang" controls to piecewise constant functions 
with values in {-1, l} . Of course, this covers the time -independent case 

in particular. 

The main point of this paper is that these conditions cannot be 
weakened. This will be shown by giving three examples of non-closed 
attainable sets. These examples cover, in our opinion, the simplest 
possible conceivable departures from the commutativity condition. Thus, 
our results constitute a rather final answer to the closedness problem. 

Our results also apply to systems in which X is a column vector in 
R~, rather than an nxn matrix (cf. Remark 1 of section 4). 
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2 . Notations and Preliminary Lemmas 

We shall consider the control problem 

m 

X(t) = (A 0 (t) + E u.(t) A.(t» X(t) , (1) 

2=1 — ~ 

where X(t) belongs to GL (n, R) (the set of all nonsingular real nxn 
matrices). The functions A.(t) are supposed to be bounded and 
measurable, with values in the set M(n,R) of all nxn real matrices. 
For AeM(n,R), define the norm of A (denoted by |]Aj) as the supremum 
of jAx||, where x ranges through all the vectors in R~ such that 



We shall denote by U(T), for each T? > 0, the set of all 

( 2 ) 

measurable' 1 functions defined in the closed interval [0, T] with values 
in the cube { (u. , . . . , u ) : -1 ^ u. ^ 1, i = 1, . . . , m} . We shall denote 
by UB (T) the sublet of U(T) whose elements are the "bang-bang" 
functions, i. e. the measurable functions (u^(t), . . . ,u m (t)) such that 
u^(t) = 1 or u^(t) - -1 for all i = 1, . . . ,m and all 0 ^ t ^ T. Finally, the 
set of all ueUB(T) that are piecewise constant will ue denoted by UBP (T). 

It is clear that U(T) is a bounded and weakly Closed subset of 
i- 2 [0,t], so that U(T) is weakly compact. We also have: 
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LEMMA 1. UBP(T) is weakly dense in U(T). 

Proof: It is clearly sufficient to assume rn = 1. Since every function 
in U(T) can be approximated in the L^-norm by piecewise constant 
functions, it follows that it will be sufficient to show that every constant 
function is a weak limit of elements of UBP (T). 

Let u(t) = r ^ 1, for 0 ^ t ^ X- We can assume r ^ 0. For each 
interval 1^ = [a, b], let the function be defined as follows: 

fj(tJ = -1 for a ^ t ^ a + \ (l-r)(b-a) 

f_(t) = 1 for a + j (l-r)(b-a) < t ^ b. 

f b 

Then, clearly, \ >L(t)*^-~-r-{b-a). Now define u. (for k = 1, 2,...) 

J a. i ~£- 

by partitioning the interval [0, T] into k intervals 1^, .... i,^ of 
length Tk and letting u^Ot) = (1) ^ or each t_e 

now obvious that the functions u^ belong to UBP (T) and that their weak limit 
is u. The pi oof of our lemma is thus complete. 

Let ueU(T) Let X( u > * ) be the solution of equation ()) which 
satisfies the initial condition X(0) = I (1^ = nxn identity matrix). The set 
of all matrices X(u,T), for ueU(T), is the attainable set at time T, and we 
shall denote it by S(T). If we restrict ourselves to functions ueUB(T), 
i~esp. ucUBPlT)), we can similarly define the sets SB (T), SBP(T). 

The union of nie sets S(t) for all 0 ^ _t ^ T will be denoted by 
In a similar way, we deline the sets SB '(T), SBP'(T). 

It is clear that no loss of generality is involved in limiting ourselves 
to the study of the sets attainable from the identity. Indeed, the set of 
matrices attainable from any other M e G L (n. R) is just the set of all 


f X “ • • • > -iL 


It is 
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products X(u, t)M. 


LEMMA 2. Let the functions converge weakly to _u . 
{X(u. , t)} converges uniformly to X(u, t) for O^t^T. 

Proof : For each veU(T), we have 

m 

X.(v, t) = I_+ \ [A (t) + 2 v.(t) A (t)] X(v, t) dr 
J 0 U i=l - - “ 


Then 


( 2 ) 


Since the functions A^ are bounded, and |v^(r)| ^ 1, there is a 
constant C > 0 such that 

||X(Y,1) II ^ 1 + £ f- ||X(V, T) II dr 

J 0 

for all e U(T), and all 0 ^ _t ^ T. 

It follows by a well-known argument that 


|| X(v, t) || ^ exp (Ct), for all v, t. 


In particular, we see that the functions X(Y> * HveU(T)) are uniformly 
bounded. Equation (1) then implies that the derivatives of these functions 
are also uniformly bounded. 

To show that ) converges uniformly to 2^ u > ' )> i* * s sufficient 

to show that every subsequence has a subsequence that converges uniformly 
to X(u, ' )■ By the previous paragraph and the Ascoli-Arzelk theorem, 
every subsequence has a subsequence that converges uniformly to some 
function Thus, our lemma will be proved if we show that, if {v } converges 
weakly to v, and if X(y, t * ) converges uniformly to X(* ), then 
X( ) 5 X<2.->- 
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Equation (2) implies that 


X(v_ k , t) - I + 



- X(t)] dr 


IS 


m 


(r) + 2 (x k ) i ( T )^ i ( T )] X( 7 )^ T 

i-1 


Using the weak convergence of to v, and the uniform convergence 
of X(v^, ) to >C(' ), it follows that 

pt no 

X(t) = 1 + \ [A (t) + 2 V (t) A (t)] X(t) dr . 

J o u i=i 1 - 

Then, X't) = and our lemma is proved. 

COROLLARY 1. The mapping u X(u, • ) hs continuous from 
U(T) with the weak topology into the space of continuous M(n, R) - valued 
functions in [0, T] with the uniform topology 

30R0LLARY 2. The sets S(T), S'(T) are compact . 

COROLLARY 3. The sets SBP(T), SBP’(T) are dense in S(T). 
S'(T), respectively. 

Proof . Corollary 1 is a restatement of Lemma 2. Cor. 2 follows 
from Cor. 1 and the iact that U(T) is weakly compact , 

Finally, Cor. 3 follows from Lemma 1 and Cor. 1. 


3. Clo^e. ness of the "Bang-Bang "-Attainable Set 

It is clear from the preceding section that closedness of the attainable 
set SB(T) (resp. SBP (T), SB '(T), SBP'(T)) is equivalent to the identity 
J5(T) = SB(T) (resp. S(T) = SBP(T), S_'(T) = SB '(T), S.'(T) = SBP'(T)). 
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Tho following theorem is a positive result in this direction and, as we 

shall prove in the next section, it is the best possible result of that type. 

THEOREM 1. If all the brackets [A ^(t) , f) ] v anish ( for all i, j_, 

t, V) , then SB(T) and SB'(T) are closed . If, in addition , the functions A 

are piecewise analytic , then the sets SBP(T ) and SBP '(T) are a Iso closed . 

Proof . If our assumption about the brackets holds, the solution of 

equation (1) is given by ^ m ^ 

X(u,t) = exp(f - _^ (T)dT). ff exp ( f ~ A (T)u . (r)dT). 

— Jo° J 0 i i 

1 =1 


To verify this, notice that:(i) the derivative of exp(F(t)) is F'(t) exp (F(i)), 
if F is a matrix- valued function such that F(t^) and commute for all 

t_ j , t _£ , a nd that: ( .) exp (_M + N) = exp( M) • exp (N) if M andhJ^are two com - 
muting matrices (these two facts are proved, using the power series expan- 
sion for the exponential, in exactly the same way as for the scalar case; the 
commutativity makes it possible^to "rearrange" factors). From (i) it fol- 
lows easily that X(u,^) = exp ( J (A^ + T, _u.A.)). The desired expres- 
sion then results from (ii). 

It follows from Lyapounov's theorem on the range of a vector-valued 

r 1 . 

measure (Lyapounov [7], Halkin [2.],, that the set of matrices I A.(T)\i(T)dr, 

J 0 - ~ 

where u ranges over the set of all measurable functions with values in {-1, 1 }, 
is compact for each i_. Thus, the first part of our statement is clear. The 
second part follows in a similar way: according to a theorem of Halkin 

r — 

(see[2] and [3]), the set of values / f_(T^u(T)<iT, where u^ ranges over all 

J 0 

piecewise constant [-1, l]-valued functions in [0,^T ], and where _f is a 
vector- valued piecewise analytic function, is compact. 

4. Counterexamples 

We jW show that the assumptions of Theorem 1 cannot be weakened. 
Clearly, the simplest situations in which these assumptions do not hold are 
a) the control problem 

2 ( 1 ) = (B 4 .ii)C) X(t), 


(3) 
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where B and C are constant matrices such that BC / CB . 
b) the problem 


X(t) = (u(t)B f v(t)C) X(t) 


(4) 


where B and are as in a), and 
c) the problem 


X(t) = v(t) F(t) X(t), (5) 

where Fisa matrix~valued function such that [F(t>. £('■)] / 0 for some 
t, t\ 

THEOREM 2. In each of the cases a), Ja), c), the set SB(T) need 
not be closed . 

P roof . We shall exhibit examples of problems where SB (T) fails 
to be closed. Our three examples will involve 4x4 matrices, i. e , we 
shall be working in GL (4, R) . The examples for cases b) and c) will be 
derived from the example for case a). 

We let 


“o 

0 

0 
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9 
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Let X(y,f. ) be the solution of equation (3) whose value at = 0 is 
the identity matrix. 

L is possible to compute .>■ t) explicitly. The result is 



x(y,D = 




If SB(T) were closed for some T > 0, it would fo.'ow from Cor. 3 
that 3B(T) = ^(I_). In particular, the matrix ^ would belong to SB (T). 

Thus, there would exist a "bang -bang” contro^ v for which 

pi pi pi rr 

0 = v v(r) d T - \ Tv(r)dT = \ v(t) ^ ^x(Mii*-dT. (6) 

Jq vq _ J 0 ‘ *" 0 

We shall show that this is impossible. Indeed, by repeated 
integrations by parts we get: 



C X v(X) dX dT 
J f) “ 


rT 

\ f'(r) h(r) dT 

J 0 " 


T 

= f(T) h(T) - C“f( T ) h'(T)i 
J 0 ~ 


= *(!• Ml) - f 1(t) 2'(t)t di 
j o 
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T 2 

= f(I)h(T) - f - (tI t dr 

T 

= f(T)h(T) - 1 T fV) + 1 \ _ U< T )] 2 i T 

J f) 


r± 2 

If a control v satisfies (6), it follows that \ [f(r)] dy = 0. This 

J 0 

implies that f(T) = 0 almost everywhere. Since v is the derivative of f 
a. e. , we must have that v = 0 a. e. , so that v cannot be "bang -bang”. This 
completes the proof that SB (T) is not closed for any T > 0. 

Turning now to case b), we shall use the same matrices 15 and C_ as 
before If _X(u, v, • ) denotes the solution of equation (4) whose value at 
t = 0 is I, it is clear that XU>Y> ' ) * s the function X(v, • ) of the previous 
paragraph. We claim that X(l, 0, T) cannot be attained by "bang-bang" 
controls u, v, In view of what we have proved above, it is sufficient to 
show that, if X(u,v, T) = }£(1, 0,TT), then u = 1. But this can be seen easily 

fl 

as follows: we can compute X( U >Y>.Z) explicitly and obtain \ u(t) dT as 

J 0 

the value of the entry in the third row, fourth column. Since this entry has 
to be equal to TT for ,X^>®>Z)» an< * si nce H(t) ^ 1 for all r , u must be 
1 almost everywhere. 

Finally, we consider case c). Here, we define 


F(t) = exp( -Bt) . C . exp( Bt) . 


where B and C are the same matrices that have been used for the other 
two cases. Let Y(v, • ) be the solution of equation (5) whose value at t = 0 
is I. It is seen immediately that 


Y(y,t) = exp(-Bt) . X(v,t) . 
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Since we know that there does not exist a ’’bang-bang" control v 
such that X(v, T) = ** follows that Y(0,T) is not attainable at 

time JT by "bang-bang" controls. Our proof is thus complete. 

Remarks : 

1) It is clear that all our results are equally valid for control problems 
of the type 


m 

= [A (t) + 2 u (t) AJt)] x(t) , (7) 

i=l 1 ^ 

where x(t) is a column vector in R— and A rt . . . . ,A are matrix-valued 
(3) 

functions' . This is obvious for all our positive results, because the 
solution of equation (7) with initial condition x(0) = Xq is just X( u, * ) 3 Cq. 

As for Theorem 2, we need only observe that the problem considered in case 
a) is equivalent to the "vector" problem 

x(t) = (B + v(t) C) x(t) , 

where x(t) is a 4x4 matrix, considered as a vector in 16-dimensional space, 

and where B and ^ are suitable 16x16 matrices (the matrices of the 

linear transformations x -» Bx and Cx , respectively, with respect 

to an appropriate basis of R^). The sets attainable from the "vector" 

Xq = I. will coincide with the sets S(T), SB(T), and in this way we get a 

counterexample for case a) of the "vector" problem Obviously, similar 

considerations apply to the other cases 

2) For completeness, we should give an example of a situation in 

which all the commutativity assumptions of Theorem 1 hold, but SBP (T) 

fails to be closed because of non-analyticity of the functions A. . It is Weil- 
ls T ~- 

known (and easy to prove) that the set of numbers \ f(t) u(t) dt , where 

J 0 
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f(t) = sin(l/t), and where u ranges over all piecewise constant 

{■1, l} -valued functions in [0, T], is not closed for any T > 0. Let this 

be denoted by A^.. The set B^. = {e":xeA} is therefore not closed. 

But B,j. is the set of points attainable from jt = 1 at time T by n bang- 
bang" controls, for the system 

x(t) = f(t) u(t) x(t) 

(which is of the form that we are considering, with n = 1). Moreover, by 
multiplying the function £ by a smooth function that vanishes at the 
origin to a sufficiently high order, we can modify our counterexample 
so that will be as smooth as desired and even C°°). 
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Footnotes 


1 . We are using here the standard notation [M, £J] = MN-NM. Thus 
”IM,n] = o“ is another way of saying that the matrices tol and N. 
commute. 

2. We follow the standard convention of identifying functions that are 
equal almost everywhere. 

3. This is the problem studied by Kucera in [5] and [&}. 
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CONTROLLABILITY OF NON-LINEAR SYSTEM* 

Hector J. Sussman** and Velimir Jurdjevic*** 

1. Introduction 

In this article we study the controllability of non-linear systems 
of the form 

-^-=F(x,u). (*) 

Our objective is to establish criteria in terms of F and its de- 
rivatives at a point x which will give qualitative information about the 
sets attainable from x. The study is based primarily on the work of 
Chow [4] and Lobry [16], although it is similar in its approach to works 
by other authors in that it makes systematic use of differential geometry 
(for instance, see Hermann [8], [9], Haynes & Hermes [6], Brockett 
[2], etc.). 

The state variable x is assumed to take values in an arbitrary 
real analytic manifold M, rather than in R n . We chose this generalization 
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because it creates no essential new difficulties while, on the other hand, 
it allows for certain applications which are not commonly treated in 
control theory. For instance, when M is a Lie group, then the present 
results can be specialized to obtain more detailed controllability 
criteria. Control problems on Lie groups were first considered by 
R. W. Brockett in [2], and will be treated in a forthcoming paper by the 
authors. 

Most of the recent studies on controllability of non-linear systems 
have essentially dealt with symmetric systems, i. e. , systems of the 
form (*) with the property that F(x, -u) = -F(x, u) (Hermann [9], Haynes 
and Hermes [6], Lobry [16]). As remarked by Lobry in [16], the con- 
sideration of symmetric systems often excludes interesting situations 
arising from mechanics. In these cases the system is of the form 


dx 

dt 


= A(x) + H(x)* u. 


A notable exception is the work by Lobry [17]. Lobry stated 

3 

(and proved for the case of two vector fields in R ) the result for non- 
symmetric systems that appears here as Theorem 3. 1. 

Our results apply to non- symmetric systems. We obtain some 
general information about the geometric structure of the attainable 
sets showing that they "practically" are submanifolds (see Theorems 
4.4 and 4. 5 for the precise statements). This information yields a 
complete answer to the problem of deciding when the sets attainable 
from a poi: x have a non-empty interior. The criteria obtained 

involve purely algebraic manipulations of F_ and its derivatives (of all 


orders) at the point x (see the Remark below). 
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In particu'nr, our results contain those of Kucera [14]. In this 
connection we observe that our proofs are of interest even for the case 
treated by Kucera (see Sussman [21]). 

We have emitted the consideration of non-autonomous systems; 
they can be treated analogously by the familiar procedure of reduction 
to an autonomous system (i. e. , by considering the state variable to be 
defined in M_ X ,R). 

The organization of the article is as follows: in section 2 we intro- 

duce notations and basic concepts; in addition, we quote some well-known 
basic results which will be used later. In section 3 we prove our main 
results in differential geometric terminology. 

In section 4, we apply these results to control systems. We 
derive the algebraic criteria mentioned above (Corollaries 4. 6 and 4. 7) 
and we prove two "global results: we show that, for a large class of 

manifolds, accessibility (i. e. the property that, for any given x, the 
set of points attainable from x has a nonempty interior) implies strong 
accessibility (i. e. that, for any given x and any given fixed positive_t, 
the set of points attainable from x at time t_has a nonempty interior). We 
also show that, for a still larger class, including the Euclidean spaces, 
controllability implies strong accessibility. 

Finally, section 5 contains examples. We show how our results 
can be used to derive the classical controllability criteria for the 



system 
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We also derive the results of Kucera and indicate some generalizations. 

R emark . An assumption that is made throughout the article is 
that F is an analytic function of jc. This guarantees that all the inform- 
ation about the system is actually contained in F and its derivatives (of 
all orders) at a given point x. The analyticity assumption cannot be 
relaxed without destroying the theory (cf. Err triple 5. 3). 

Another assumption that we make is tha { the trajectories of the 
system are everywhere efined. As opposed to the previous one, this 
assumption is not essential (except for the "global" Theorems 4, 9 and 
4. 10). We use it, however, because it considerably simplifies all the 
proofs. 

2. Preliminaries 

We shall assume that the reader is familiar with the fundamental 
notions of differential geometry. All the definitions and basic concepts 
utilized in this paper can be found in standard books, (for instance, 

[1], [3], [7], (13] and [19]). 

The following notations will be used throughout: 

R--the set of real numbers. 

R n --n - dimensional Euclidean space. 

M -- the tangent space to the manifold M at the point x. 

TM - - the tangent bundle of the manifold M. 

V(M) - - the set of all analytic vector fields on the analytic 
manifold M. 

We will regard V(M) as a Lie algebra over the reals. 

For any X. and _Y in V(M), we will denote the Lie product by 
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[X,_Yj(i. e. , [X, Y ] = XY - YX ) . All the manifolds will be assumed to 
be paracompact. Recall that a submanifold of a paracompact manifold 
is paracompact. Also, a connected paracompact manifold is a countable 
union of compact sets. These facts imply (cf. Lobry fl6], p 589): 

LEMMA 2. 1. Let M be a (paracompact) manifold of dimension n. 
Let S be a k-dimensional connected submanifold of -M. If jc< _n, then 
the set of points of S has an empty interior in _M. 

A subset JD of V( M) will be called involutive if, whenever and _Y_ 
belong to D, then [X,Y] also belongs to D. A subalgebra of V(M) is an 
involutive subspace. Let D C V(M). An integral manifold of D is a 
connected submanifold S of M with the property that = ^(D(x)) for 
every x e S, where D(x) = {X (x) : X D}, and where <£ (D(x)) is the 
subspace of M spanned by D(x). We state the following basic results 

X 

about integral manifolds: 

LEMMA 2. 2. Let E) be an involutive subset of V(M), and let 
x e_M. Then x is contained in a unique maximal integral manifold of 
JD (here "maximal" means "maximal with respect to inclusion"). 

This result is classical if the dimension of^(D(x)) is the same for 
each x e M(Chevalley [5]). For a proof in the general case, see Lobry 
[ 16 ]. 

If D C V(M), we denote the smallest subalgebra which 

contains D by^y (D) , and the maximal integral manifold o': ">) through 

x by I(D,x). Recall that, if X is a vector field on Kl, then .... an 
integral curve of X if a is a smooth mapping from a closed interval 
ICR) into M such that 
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• = X(a(t)) for all t el. 
at — — 

DEFINITION 2. 3. If D is a subset of V(M), then an integral 
curve of D is a mapping afrom a real interval [t, t ’ ] into ^ such that 
there exist t = t^ < :^ < . . . < t^ - t , and elements X ^ , . . , , of D 
with the property that the restriction of a to [t. j,t.] is an integral 
curve of X. for each i = 1, 2, . . . , k. We ha^e the following elementary 
fact: 

LEMMA 2.4. LetJD£2 VJM). Let a: [tg,tj] an integral 

curve of D, and let a(t) = x for some t e [t^, t ^ Then a (s) el(13,x) 
for all s e [f^, tj]. 

Proof . It is sufficient to consider the case when a is an integral 
curve of X, _X e For each maximal integral manifold S of^“(D). 
lot J(S) be the set of all s e ft^ , t^J such that a(s) e S. From the local 
existence and uniqueness of solutions of ordinary differential equations 
it follows that, if s eJ(S), then there exists £>0 such that ( s-r, s+r) 
[tg,tj] Cl J(S). Thus, J(S) is open relative to f , t ^ Since the 
maximal integral manifolds of (D) are disjoint, we hr.ve that, *or some 
maximal integral manifold S, [t^, tj] d J(S). But a(t) eI(D,x); therefore, 
our proof is complete. 

Chow's theorem provides a partial converse to the above lemma. 

If D CV(M), then ^ is symmetric if, whenever X e D, -X also belongs 
to £). We can now state Chow's theorem as follows: 

LEMMA 2. 5. Let D C V(M) be symmetric, and let x e M. Then, 
for every y e I(D,x) there exists an integral curve a: [C,T] -*• M of D, 
with T 2 0, such that a(0) = x and a(T) = y. 
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In other words, every point of the maximal integral manifold of 
,1 (_D) through an be reached in positive time by following an integral 
curve of D having x as its initial point. 

DEFINITION 2. 6. Let D CL V(M), and let x e _M. If T_> 0, then, 
for any ^ e M_, * s D- reachable from x at time T if there exists an in- 
tegral curve a of D defined on [0,T] such that a(0) = x and a(T) = y. The 
set of all D- reachable points from x at time T is denoted by L (D,T). 

The union of L (D,t) for 0 ~t< » (respectively for 0 ^ t ^ T) is denoted 

X 

by L (D) (respectively L JD,T)). 

~ X ^ X 

3. Integrability of Families of Analytic Vector Fields 

As an introduction to the general situation, we first considered the 
case when D is a symmetric subset of V(M). Chow’s theorem can be 
utilized to obtain a necessary and sufficient condition for L (D) to have 

X 

a non-empty interior in M . Let n = dim M = dim ^(D) (x). Then 
, x) is an n-dimensional submanifold of A4, and hence is open in M. 

By Chow's theorem we have that L (D) = I(D,x). We conclude that L (D) 

^ X ~ X 

is )pen in M. Conversely (and without invoking the symmetry of IDJ if 
dim«y(D) (x)<ji , then _I(D),x) is a connected submanifold of M of dimension 
less than n; then from Lemma 2. 1 it follows directly that I(D,x) has an 
empty interior in M. Since L (D)CI(D,x), L (D) also has an empty interior. 
Thus, if D is symmetric, a necessary and sufficient condition for L (D) 
to have a non-empty interior in ^ is that dim ^"(D)(x) = dim M. Moreover, 
this condition is necessary even in the non-symmetric case (Lobry [ 16]). 

We shall show that it is also sufficient. For this purpose we shall 
assume that the elements of 13 are complete--recall that a vectt r field 
2£ is complete the integral curves of X. are defined for all real £ 

(cf. [131, p. 13). 
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THEOREM 3. 1. Let M be an n - dimensional analytic manifold, 
and letD£I.V(M) a family of complete vector fields, A necessary 
and sufficient condition for L (D) to have a non-empty interior in M is 
that dim.y'fpHx) = n.. Moreover, if this condition is satisfied, then for 
each T > 0, the interior of L (D, T) is dense in L (D, T) (thus, in 

— — ~ X ~ X 

particular, L (D,T) has a non-empty interior). 

Proof . We already know that the condition of the theorem is 

necessary. So we assume that dim</(D)(x) = n, and we prove the 

second statement. Clearly, this will imply that L^D) has a non-empty 

interior in M . Without loss of generality we can assume that D is finite. 

Let D = {Xj, . . . , X^}. For each i = 1, 2, . . . , k, let 4 .(t, • ) be the one- 

parameter group of diffeomophisms induced by X.(i. e. , t -►4.(t, y) is 

the integral curve of X. which passes through at p =0; the fact that 

it is defined for all realp follows from the completeness of X.). If m 

is a natural number t = (t, , . . . , t ) is an element of R m , and i = 

~ 1 m ~ ~ 

(i , . . . , i^) is an m- tuple of natural numbers between 1 and p, then we 

denote the element 4. (t.,$. (t , x). . . )) by 4 . (t, x). Let +D 

i, i lo c i m x ^ — — 

i c m ^ 

be the family of vector fields obtained from D by adjoining the vector 

fields -Xj, . . . , -X^ to p. Then, + D is symmetric, and dim^+D) (x) = n. 

From Chow's theorem we conclude that L (+ D) is open in M. Clearly, 

~ x — ‘ ^ ' 

the elements of L (+D) are exactly those elements of M which are of 

ew X ™" ' 

the form 4. (t,x) for some m, some m- tuple i, and some t e R m . For 

X rs-t " — ^ 

each i , and for each natural number N •> 0, let A (i , N) be the set of 
ail points of pp of the form 4 . (t , x) f where ||t || (here, (jtj| = |t | 

+ . . . + J t |). Since A_(i , N) is the image of the compact set 

{t : H t |l £ N] under the continuous mapping t -* 4 . (t,x), we have 
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that A ( i , N) is compact. Also, since L ( + D) is the union of the sets 
A(i,N) (taken over m, i and N), it follows from the category theorem that, 
for some i and N, the set A(i,N) has a non-empty interior in M. For 


such an i , let F;R 


M be defined by F(t) = ? . ( t, x). Then F is an 

~ 1 — • — 


analytic mapping whose image has a non-empty interior in _M. By Sard's 
theorem (Sternberg [ 1 9 j) , the differential dF^ of JF at t must have rank 
n for some t e R . Since dF depends analytically on t , it follows 

ft m ^ 

that the set Q = ft : t R , rank dF < n} has an empty interior. 

m $ m 

Let Q = R - . Then Q is open and dense in R 


Let T > 0, and let y c L (D,T). We now show that y is in the 
closure of the interior of L (D,T). It is clearly sufficient to assume 

X 

that ^ eL (D,t), where 0 5 t < T (for each point of L (D,T) is in the 

closure of { L (D,t) : 0 £ t < T }). Let y = $ . ( s , x) where j = (j j ) 

X J ~ * P 

s=(s 1 s ), s > 0, . . . , s >0, and s. +. . . + s = t. Let U = Q f) 

~ l pi p i p — 

{t:!!tl|<T-t}n{t:t 1 >0,...,t >0}. Uis open, and its closure 

contains the original 0 of R m . Since dF, has rank n at each point t s U, 

^ t ~ 

it follows that F(U) is open. Let V = [5.(s,F(t)) : t e Ul V is the 

*■— — 1 L J ^ — 

image of^(U) under the diffeomorphism _z -► $. (s _zl; therefore, _V is 
open in ^ and, moreover, every element of _V is ^-reachable from x 
at time ||^sjj + ll^ll + il * || < T (here we use essentially the fact 
that tj,.. . , t are non- negative). It remains to be shown that belongs 
to the closure of V. Let ft } be a sequence of elements of U which 

~ ~ q — 

converges to 0 . Then 

lim 5 . (s , F(t )) = J . (s, F(0)) = f . (s, x) = y. 
j ~ ~ j a- ** j ~ 

This completes the proof of the theorem. 



-70- 


We now wait to state an analogous theorem for the sets L (D,T). 

3C 

For this purpose, we shall introduce a Lie subalgebra ^(D) of .^"(D) 
which will be related to these sets in the same way as^(D) is related to 
the sets L (D,T). The aim of the following informal remarks is to 


motivate our definition of i^(D). We shall ignore the fact that time has 
to be positive. Moreover, we shall assume, for simplicity, that D 
consists of nree vector fields X^,X 7 and Xy Let 4 and $ ^ be the 

corresponding one-parameter groups. It is clear that#(D) has the 
fallowing 'geometric interpretation": #"(D)(x) is, for each x e M, the 
set of all limiting directions of curves through x that are entirely t n- 


tained ir L (D). Thus, for instance, if i-1,2, 3, then all the points in 

^ X 

the curve t -*> 4 .(t, x) are attainable from x (recall that we are forgetting 
about positivity), and this is reflected in the fact that X.(x) belongs to 


</(D)(x). Similarly, the curves a j( t) = 4 .(-t, $ ^(-t, 4 .(t, 4 ^(t, x))))are also 
contained in L (D). By the well known geometric interpretation of the 
Lie bracket (cf. Helgason [7], p. 97), the limiting direction of a.^ 
is [X.,Xj](x) (after a repr.rametrization). Thus, it is clear why [X.,X^] 
belongs to«y(D). Obviously, a similar argument works for the brackets 


of higher order. The geometrical meaning oi&\D) is now obvious. 

If ^j(D) is going to play the desired role it is clear that ^q(D)(x) 
will have to be the set of all limiting directions of curves y through x 
such that Y (t) is "attainable from x in zero units of time" for all t. 


Notice that the curves c ..(t) of the preceding paragraph have this 
property. Indeed, a/ 1) ca.? be reached from x by "moving forward" in 
time 2t units, and then "backward" another 2t units. This shows that 
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the vector fields [X. , X^] are reasonable candidates for membership in 

jTg(D). A similar argument applies to higher order brackets, such as 

fX., [X., X. 1], etc. On the other hand, a vector field such as X. should 
1 j k i 

not be included in,jT^(D) by definition, because we do not know whether 

the points $ .(t.x), t i 0, can be reached from x in 0 units of time (but, 

of course, it may happen that some X. will belong to d^(D) anyhow; 

for instance, we could have X^ = [X^.X^]). However, the vector fields 

X. - X. will have to be included, because (X.-X.)(x) is the limiting 
i j i y 

direction of the curve t -»• $ j(-t, I .(t, x)). In other words, the subspace 
generated by the differences X. - X^ will have to be included in .y^D). 
This subspace can also be defined as the set of all linear combinations 
X jX j + X 2 X 2 + X. ^X^ such that \ j + X 2 + X 3 = ® (that all the differences 
X. - Xj are linear combinations of this type is trivial; conversely, if 
Y = X jXj+ \ 2 X 2 -X 3 X 3 with X 1 +\ 2 +X 3 = 0, then Y = XjXj + X 2 X 2 + 
(- Xj - X 2 ) X 3 , i. e. , Y = X 1 (^ 1 -X 3 ) + X 2 (X 2 - X 3 )). 


We conclude that the reasonable candidates for membership in 
«£*(D) are: (i) all the brackets [X.,X.], [X.,[X.,X ]], etc. , and (ii) all 

U X J X J K 

the sums XjXj + X 2 X 2 + ^3 X 3’ w ' iere 2 X. = 0. Notice that the subset 
generated by (i) is clearly the derived algebra of ^(D) (by definition, 
the derived algebra of a Lie algebra L is the subalgebra of L generated 
by all the brackets [X, Y], X e L, Y e L; it is easy to check that L 1 
is in fact an ideal of L; cf. Heigason [7], p. 133. 

t 

We now return to o r formal development. Let^(D) denote the 
derived algebra of«/(D). Motivated by the previous remarks, we define 
,/q(D) to be the set of all sums X + Y, where X is a linear combination 
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P 

Z X.X. with X,,. .. ,X eDand^X. =0, 
i=i 1 1 1 p 

and where Y e^'(D). It is obvious that ^q(D) is an ideal of ^F(D). 

One shows easily that^(D) is the set of all vector fields of the form 

P 

2 X.X. + Y 
i=l 1 1 

where X. X belong to D, Y belongs to '(D), andX.,... ,X are 

P P 

reals (but X^ + . . . + X^ need not be zero). From this it follows im- 
mediately that ^ 0 (D) is a subspace of ^"(D) of codimension zero or one. The 
codimension will be zero if and only if some-^e D belongs to «^q(D) (in 
which case every X e D will belong to «^j(D)). Similarly for each x g M, 
if k = dim^(D) (x), then the dimension of^Q(D) (x) will either be k or 
k- 1. 

We shall also be interested in associating to each D CZ V(M), a 
set D* of vector fields in the manifold MXR. Recall that the tangent 
space to MXR at a point (x, r) (x g M, r cR) is identified, in a natural 
way, to the direct sum M © R . If x eV(M), Y e V(R) , we define the 
vector field X 0Y e V(MXR ) *v 

(X©Y) (x,r) =(X(x), Y(r)). 

The set D* is defined to be the set of all vector fields X©*^*, where 

X € D, and where is the "canonical" vector field on R (( *~f) (r) = 

o t ^ o t 

^ (r)). Using the identity [X©X\ Y©Y' ] = [X,Y] © [X',Y'], one 

shows easily that ^'(D 5 ^ is the set of all vector fields of the form X © 0 , 
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where X p,p(D) and 0 is the zero vector field. Therefore, J~(D*) is the 
set of vector fields of the form 

£ X (X © -A-) + Y©0 (*) 

i = 1 

where X, X belong to D, Y c (D), and X \ are scalars. 

1 p 1 P 

THEOREM 3.2. Let M be an analytic n-dimensional manifold, and 

let D be a family of complete analytic vector fields on M. Let x e M. and 

let T >0. Then L (D,T) has a non-empty interior in A4 if and only if dimJT(D) 

x 0 

(x) = n. Moreover, in this case, the interior of L (D,T) is dense in L (D,T), 

X X 

Proof. The main idea in this proof is to modify our problem so that 

we can "keep track" of the time elapsed while we move along an integral 

curve of D. We shall then apply Theorem 3. 1 to the modified system. We 

shall work in the manifold M X R . As in the preceding paragraphs, we 

let the family D* of vector fields on M_ X R be defined by D* = {X : 

X £ L }. It is clear that there is a one-to-one correspondence between 

integral curves a of such that a(0) = x, and integral curves <3(1)*) such 

that 3(0) = (x, 0). This correspondence is given by assigning to each curve 

a the curve t -*(a(t),t). It follows that y e L y (D,T) if and only if (y,T) 

c L. m (D*,T). We show that L (D,T) has a non-empty interior in M 
~ (x, U) x — 

if and only if L ^ ^ (D*) has a non-empty interior in M X R . Assume 
that L x (D,T) has a non-empty interior in M, and let V be a non-empty 
open set such that VCL (D, T). Let X e D^, and let $ be the one-para- 
meter group of diffeomorphisms of M generated by X. Consider the 
mapping F : VXR-* MXR defined by F(v, t) = ( $ (t, v) , T ft). It is 
immediate that the differential of JF has rank r. + 1 everywhere. Therefore 
F maps open sets onto open sets. 
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Since F(VX(0,»))CL, m ( we conclude that L, m (D*) has a non- 
empty interior in MXR. 

To prove the converse, assume that L, m (D*) ^as a non-empty 

interior in MXR . By Theorem 3. 1, for each t with 0 < t < T, L, m 
— — - — — — ~ ^x, uj 

(D*,t) has a non-empty interior in MXR. Let V be a non-empty open 
subset of M. and let W be a non-empty open subset of R such that VXW C 
L . m (D*,t). Let s e W. Since VX { s] <^L. n .(D*,t), we conclude 

^ (X f U) — “ “* — • “ Uj 

that Vd L (D, s). Let .X. e D, and let $ be the corresponding one-para- 
meter group on M. Denote the mapping y -*■ $ (T-s,y) by G. Then 
G(V) is open. Since G(V) is contained in L x (D,T), it follows that 
L (D,T) has a non-empty interior. 

We conclude from Theorem 3. 1 that L (D,T) has a nonempty 
interior if and only if dim ^T(D*)(x, 0) = n + 1. To complete the proof of 
the first part of our statement, we must show that this last condition holds 
if and only if dim^(D)(x) = n. We recall, from the remarks pre- 
ceding this proof, the fact that every X* e S"(D*) can be expressed as 
(#) X* = 2 x. (X.(7) £ ) + Y(+)0 where X.,.. . ,X belong to D 

j X o t l p 


and Y z3T'{D). Now assume that dim S' (D*)(x, 0) = n + 1. Let 
v c M . Then (v, 0) must belong to S’(D*)(x, 0), so that (v, 0) = X*(x, 0), 
where X* Then formula (#) holds for suitable X., X., Y. 

Therefore 


v =(SX.X. + Y) (x), 

and 

£ = s *iTt (0) - 



-75- 


The last equality implies that 2 \ . = 0, so that the vector field 2X .X. + Y 
belongs to (D). Thus e ^l(D)(x). We have shown that .M D )( x ). 

U U X /vr 

Therefore the dimension of .^(D^x) is _n. Conversely, let dim ^j(D)(x) = n. 

Let v c M . Then v e */_(D)(x), so that 
X u 

V = ( 2 X;X. + Y) (x) , 

where the X belong to D, Y and 2X . =0. Therefore, 

(v, 0) = ((SXjX. + Y) ©<ZX.) ^-) (x, 0) 

= .(JCf (+) + Y ©0)(x.0). 

This shows that (v, 0) belongs to ^”(D*) (x,0). Pick an X eD. 

Then XQ-^* (x, 0) belongs to D*(x, 0) by definition, and X @0 
(x, 0) belongs to i7 (D*)(x, 0) by the previous remarks. Therefore 
(0 , rr* (0)) belongs to 3T (D*)(x, 0). We have thus shown that ^ r (D*)(x, 0) 
contains all the vectors (v, 0), v e M , and also the vector (0 , (0)). 

X ot 

Thereforecjr(D*)(x,0) = (MXR ). m , so that dim & (D*)(x, 0) = n + 1 

~ ix, \J) 

as stated. 

We now prove the second part of the theorem. As we remarked 

earlier, there is no loss of generality in assuming that Dis finite. Let 

y e L x (D,T). Using the notations cf the proof of Theorem 3. 1, let 

y = s . (t , x), where i = (i,,...,i ), and where t e R m is such that 
i ^ ~ 1 m ^ 

t . > 0 for i = 1, . . . , m and |j t || = T. Let { s^} ^ (0, t^) be such that 
lim s = 0. Since our condition for L (D,T) to have a non-empty 

k m k x 

tv ^ 00 

interior is independent of JT, we conclude that L x (D,t) has a non-empty 

interior for ail t_ > 0. In particular, for each jc > 0, there exists x 

which belongs to the interior of L (D, s. ). Let t , = (t. , . . . , t . , 

x K ~ k l m- 1 
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t - s, ), and let y, =1. (t , ,x ), For each k > 0, y belongs to L (D, T); 
m k k l ~ k k — k x 

since $. is a diffeomorphism, y, is the interior of L (D,T). Also.x, — x 
l k x k 

as k -► oo because D is finite and s, -» 0. Since $. is continuous in both 

— k l 

variables, and si ice t -* t, we have that y -*y, and our theorem is proved. 

The results of the previous theorems can be utilized to obtain inform- 
ation about the sets L (D,T) and L (D,T), even when dim^D)(x) < n. 

'v* X X ““ 

THEOREM 3. 3. Let D <Z V(M) be a family of complete vector fields. 
Then, for each T > 0, the set L (D,T) is contained in I(D,x). Moreover, 
in the topology of I(D,x), the interior of L (D,T) is dense in L(D,T). 
L x (D,T) has a non-empty interior in I(D,x) if and only if dim«^(D(x)) = 
dim^D)(x) and, in this case, the interior of L (D,T) is dense in L (D,T). 

Proof . If X c ^"(D), then X. is tangent to I(D,x). Thus, there is a 
well-defined restriction_X # of X to I(D,x). We denote the set of ail such 
restrictions of elements of D by D#. Since (X, Y]# = [X#, Y^], it follows 
that ^(D)# = ^"(D#). Analogously, we have that .^(D)# = c^(D#). If 
we now apply the previous theorems to the family D# of vector fields in 
I(D,x), we get all the conclusions of the theorem. 

COROLLARY 3. 4. Let SI be a maximal integral manifold of c^(D). 
Then the dimension of ^q(D)(x) is the same for all jc e _S, 

Proof. If dim#(D) (x) = k then, for each x e S, the dimension of 
^ 0 (D)(x) is either k or k-1. We show that, if dim ^(DHy) = k/- 1 for 
some x e S, then dim ^q(D) (y) = k-1 for all y e S. T et be a non- 
empty, open (relative to _S) subset of L (D) (this is possible by Theorem 
3. 3). We first show that, if y e £2 , then dim «^(D)(y) = k-1. If this 
were not the case, then necessarily dim^(D)(y) = k. Then Ly(D,t) would 
have a non-empty interior in S for all t_ > 0 . This would 
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imply that L x (D,t) has a non-empty interior in _S. But by our assumption 
this is impossible. Thus, dim l y'^(D)(y) = k -1 for allj£ e Since _S 
is connected, and £7 is open in _S, we have that dim .^(DJfy) = k-1 for 
all v e S ; therefore, our statement is proved. 

We now proceed to study the case when dim (D)(x) = dim 
^(D)(x) - 1. We begin by proving some preliminary lemmas. 

LEMMA 3. 5. Let D d,' T { M) be a family of ^^mplete vector fields. 
If X e_D^, let { $ be the one-parameter group generated by X. Then, 
for every x e JVI, and every e R the differential ds maps (D)(x) 
onto «^ 0 (D)($ t (x)). 

Proof . We first show that for every ^ e M there is an r > 0 such 
that, if v e^ 0 (D)(y), then d$ t (v) e ^ 0 (D)($ t (y)) for all t_ with |t| < r. 

It is sufficient to show that for every £ e M and every v e«^Q(D)(y) 
there exists an r > 0 such that dt ^(v) e «Fq(D)(# ^(y)) for all t with 
1 1 1 < r. Let v c _M, and let v e (D)(y). If v = Y(y) for some 
Y e (D), then an easy computation shows that there exists a neighbor- 

V ^ 

hood of t = 0 such that d$ (v) = 2* — [X^, Y] («,(y))t* for all 

t_ in this neighborhood, where [X^,Y] = Y, and [X^ n \ Y] = [X,[X^ n ”^, 
Y]] for n = 1,2,... Since each term of the above series belongs to 
& q(D)($ ^(y)), we have that d$ ^(v) e «^Q(D)(§ t (y)) for t_ sufficiently small. 
Also, for such t_ we have that d$^ (^(D)(y)) = ^Q(D)(f ^(y)); this is so 
because d# is one-to-one, and dim ^'^(D)(y) = dim «^(D)($ ^(y)) 

(Cor diary 3.4). It follows easily that the set of all t^ such that 
d W D)( x )) = tTg(D)($ ^(x)) is both open and closed. If v e «^'q(D)(x), 
we can conclude that (v) c ^(DJd^v)) for all t. This completes 
our proof. 
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As we remarked earlier, ,^(E>) is a subalgebra of.^”(D). We will 
denote the maximal integral manifold of ^q(D) through x by I^(D,x). 

If X e D then, by the previous lemma, $^(Iq(D,x)) is a maximal integral 
manifold of 

LEMMA 3. 6. Let D CZ V(M) be a family of complete vecto. fields. 
Let 21 and Y be elements of J), and let { } and fy^} be their correspond- 

ing one-parameter groups. I' S is a maximal integral manifold of 

./JD) then, for any t e R , $ (S) = y (S). 

u — ^ t t 

Proof . Let2£».X> and S satisfy the conditions of the lemria. 

Let P be the maximal integral manifold of^(D) which contains S, If 
dimJP = dim S, then S = P, and $ S) = S = y fc (S). Assume that dim 3 = k 
dim (P) - 1. We first show that there is an £ > 0 such that f ^ (S) = y^(S) 
whenever | t | < r. Let x e_S. The mapping (s,t) -•> $^(s) has rank k + 1 
at (x, 0). Let U be a neighborhood of x in S, and let 6> 0 be such that 
this mapping, restricted to f2 X ( - p, ft ) is a diffeomorphism onto an open 
subset of JP. If efl#, let and f|jr) be such that $f(y)( s (y)) = Y * 

Clearly, f_ is analytic in fi#, and f(y) ~ 0 if ^nd ‘-\ly if e ft . Mr . . over 
Xf = 1 in Q#. For every t_ such that |t| <6, the net is r\n 

integral manif lid of ^q(D). The vector field - X is tangent to 
and, since f^ is. constant on it follows that Yf = Xf on Since 

is the union of the sets f (f2) over -ft < t < ft, we conclude that 
Yf = Xf = 1 on Q#. Let r > 0 be such that the curve t -*■ J Yj.(x)). de- 
fined on - r < t < r, is contained in f2#. In addition, let »•< ft. Let g(t) = 
f($ ^.( y , ( x) ) . Then jj is analytic in (-r, r), and moreover ^(t) = f(f t (x)) - t. 
We have that q' = (Yf) (y^(x)) -1*0 and, since g(0) =0 it follows that 
g = 0 on (-r, r). But this means that $ S ^ 

for all t e (-r, r), Hence, if J t | < r, the manifold 
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$ ( Y (S)) intersects j5. Since $ ^(y^(S)) is a maximal integral manifold 
of it follows that $ (S)) = S, and tnat y^(S) = ?^(S). Let A^ 

be the set of all t_ such that$ T (S) =Y^(S) for all t in a neighborhood of 
t_. Then A is obviously open, and we have shown that s A. It follows 
easily from the preceding argument that A is closed. Therefore, 

$^(S) = y^(S) for all real .t, and our proof is complete. 

According to the above lemma, if_D£ V (M) and if x e M_, then 
the manifold $ ,(I (D,x)) depends only on t_, and not on the particular 
choice of X . We shall denote this manifold by I* (D,x). It is clear 
that I*(D,x) could be defined as the maximal integral manifold of 
cF_(D) passing through y, where y is an arbitrary point of L (D,t). 

Finally, we prove a factorization property of maps that will be 
utilized several times. 

LEMMA 3. 7. Let E be a locally convex vector space, let 
K CJL* ant * £1 be a convex dense subset of K. Let F : K -»-I(D, x) 
be a continuous mapping such that F(C) is contained in a maximal 
integral manifold J5 of JB). Then F(K) is contained in S, and F , 
as a mapping from K into S, is continuous. 

Proof . If dim S = dim I(D,x), then S = I(D,x), and the conclusion 
follows trivially. Therefore, we shall assume that dim S = dim I(D,x) -1. 

Let k e K, let X e I), and let be the one-parameter group 

induced by Then, as in the proof of Lemma 3. 6, we can find a 
neighborhood f2 of F(k) in Ig(D,F(k)), and a positive number 6, such 
that the mapping (s,t) -*# t (s) is a diffeomorphism of X (-6, 6) onto 
an open subset SI# of I(D,x). Let 1J be an open convex neighborhood of 
k such that F(UOK) O For each (-6, 6), the set (Q) is an 
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integral manifold of «^(D); therefore, if intersects JS , then 

♦ (Q) is contained and open in S. Let A = {t : It I < 6 , # (£2) O S] . 

It follows that S O 12# is the union of the sets |^(12), t_ eA. These sets 
are mutually disjoint and, since _S is separable, it follows that A is at 
most countable. Let y -*• ( s(y) , f(y)) be the inverse in 12# of the map 
(s,t) -*■ $^(s). Then the function £ defined in Ufl K by g(m) = f(F(m)) 
is continuous. Since F(U HC)C S A £2#, we conclude that j*(m) c A 
for all m_ e U H £. But^ is at most countable, and IJ fl is convex; 
therefore is constant on _U 0 C . Since IJ A C_ is dense in U_ A I£, we 
have that£ is constant on IJfl K. Obviously g(k) = 0, and therefore 
g(m) = 0 for all m g IJ fi I£; thus F(m) e 12. This shows that 12 con- 
tains a point of JS; hence 12 CS, and F(k) g §. This proves the first 
part of the lemma. 

To prove the second part, let {k^} C K converge to k. Since _F 
is continuous, Ffk^) -♦ F(k). For large n, s(F(k n )) is defined. Since 

i 

£ is continuous, s(F(k n >) converges to s(F(k)) in_S. But g(k R ) = 0, and 

therefore s(F(k )) = F(k ). Similarly, s(F(k)) = F(k). We have thus 
n n 

shown that F(k^) converges to F(k) in J>, and our proof is complete. 

Remark 3.8. It is clear that the preceding lemma is valid under 
weaker assumptions about £ and K. For instance, it is sufficient to 
assume that, for every 1c g K and for every neighborhood U of Jk, 
there exists a neighborhood V of k such that V £ t£ and V f) C is 
connected. 

We now state and prove the theorem towards which we have been 


aiming. 
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THEOREM 3. 9. Let _D_ (2 V(M) be a set of complete vector 

fields, and let x $ M . Then, for each T > 0, L (D,T)o I*(D,x) and, 

' ^ X U 

moreover, the interior of L x (D, T)(relative to I*(D,x)) is dense in L^fD.T) 
(and is, in particular, non-empty). 


Proof . If dim /^(D)(x) = dim <y"(D)(x); then we have from Corollary 

3.4 that «/g(D)(y) = <*"(D)(y) for ail y e I(D,x). Therefore, Iq(D,x) = 

I^(D,x) and our conclusion follows from Theorem 3. 3. Assume that 

dim ^"q(D)(x) = k = dim t ^'(D)(x) - 1. It is clear from Lemma 3. 6 that, 

T 

if a is an integral curve of JD such that a(0) = x » then a(T) e I^(D,x); 
hence, L x (D,T) C lJ(D,x). 

We now show that, if y e L (D,T), then^ is the limit of points 
which belong to the interior of L (D, T). Let D = [X. , . . . , X } and 

X 1 K 

let y = $ . ( T , x) , whe re || T || = T , and T.>0 for i = 1 , 2 m ( the 

'V' 

notations here are the same as in the proof of Theorem 3. 1) . Let 


j = (j. , . . . . j ) be an ^-tuple of integers between 1 and k such that the 
rank oft . (t , x) is equal to dim t 7’(D)(x) for all t in an open dense 

y*S J J 

subset ft of R s . Let ft* = {t:teR S , t.>0 for i = 1, . . . , s} O ft. 

«V **W X 

Let {t p} CZ. ft be a sequence that converges to 0, and let T^ = (T . . . , 

T »,T - lit II ). We can assume that || t || < T for all p > 0. If 

m- 1 m 'LL p 11 11 p '• m 

we let y = f . (T , a .(t ,x)), then y e L (D,T). We next show that 

7 p i ~ p * j ~ p p x' 

~ ~ T 

y is in the interior of L (D,T) relative to I (D,x). Since the 

p * H£ I' 

mapping z , z) is a diffeomorphisrr from I. ~ " (D,x) onto 

1 ~ p u 

fjt ^ 

I 0 (D,x), it suffices to show that $ .( t , * -S in the interior of L (D. lit 
^ J ~ P x v ’ >L,p 

Let V = { t : t e R S , t. > 0, . . . , t > 0, || t || = || t ||} . Clearly, if 


t e V , then «. (t,x) e L (D, II t ||). Let F :V -I '~pH )(D,x) be 
~ p ®j ~ x p p 0 

defined by F (t) = $ . (t ,x). We show that F is analytic. Since F 
P ~ J ~ P P 
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is analytic as a map from V into I(D,x), it suffices to show that it is 

P 

continuous. But this follows from the previous lemma, because V is 

P 

convex. The rank oft -► 4.(t,x) is equal to dim^(D)(x) at t = t . Since 

J 0^0 

V is a submanifold of R of codimension 1, it follows that the rank of 

P ~ lit !| 

F at t is equal to the dimension of J A (D,x). Thus, F (V ) con- 

p ~p 0 p p 

tains a neighborhood of F (t ) in I n ''~P'' (D,x). It follows that 

p ~ p u 

4.(t , x) is in the interior of L (D, lit II ). By the previous remark 

j ~ p X u ^p 

~ T 

we conclude that y is interior to L (D,T) in I. (D,x). There remains 

p X u 

T 

to be shown that y converges to y in I (D,x). The mapping (t , s) — 

p u ^ *** 

$ . (t , $.(s,x)) is continuous as a map from R m XR — into I(D,x). The set 

V = { (£,^) : t. > 0, s.>0,i = l m, j = l s, || tj| + II S \\ ~ T} 

T 

is convex, and is mapped into 1^ (D,x). Therefore, the previous lemma is 

T 

applicable, and we conclude that y -*• y in I (D.x). This proves our 

P * 

theorem. 


4. Applications to Control Systems 

We shall consider systems of the form 


=F(x(t),u(t)) 

defined on an analytic manifold M. The functions a belong to a class 
‘Wof "admissible controls". We make the following assumptions about 
Qi and the system function F: 

(i) The elements of^/are piecewise continuous functions defined 
in [0,®), having values in a locally path connected set f2 C. R m 
(O is locally path connected if, for every u e and every neighborhood 
_V of cj, there exists a neighborhood IJ of (0 such that U V, and U 
0 O is path connected). In addition, we assume that contains 
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a ll the piecewise constant functions with values in Q , and that every 
element of <%i has finite one- side limits in each point of discontinuity . 

^ is endowed with the topology of uniform convergence on compact 
intervals. 

(ii) F : M_X 57 -♦ TM is jointly continuously differentiable. For 
each u e 17 , ( * , _u) is a complete analytic vector field on M. We k now 

that for each x e M . , the differential equation 

^)p- =F(x(t),u(t)) x(0) = x, (1) 

has a solution defined for all t^ c [0, *) , where * > 0. We denote such a 
solution by fl (x, u, • ) , and we assume that TT (x, u,J:) is defined for all 
t_ e [ 0, <4 . 

For the above defined control system, we now state the basic 

controllability concepts. We say that e M is attainable from x g M 

at time _t ( t 2 0) , if there exists u c such that tt(x, u, t) = y. For each 

x e JM, we letAjx.t) denote the set of ail points attainable from x at 

time t . If 0 ^ t < <» , we define A (x, t) ={Ja (x, s) and A(x) = 1/ A 

s^t t*0 ~ 

(x, s). We say that the system is controllable from x if A(x) = M, and 

* * ■ M ■ ■ — ■ ' /-w 

that it is controllable if it is controllable from every x c M. We say 
that the system has the accessibility property from x if A(x) has a non- 
empty interior, and that is has the accessibility property if it has the 
accessibility property from every x c M. Finally, we shall say that the 
system has the strong accessibility property from x if A(x, t) has a non- 
empty interior for some t > 0, and that it has the strong accessibility 
property if it has the strong accessibility property from x for every 


x e M. 
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For to e , let X = F( • ,u); from assumption (ii) it follows that 

Cl) 

X is a complete analytic vector field -on M. Throughout the remaining 
part of this article we let D = (X : w . 

LEMMA 4. 1. For each x c M , A (x) is contained in I(D,x). 

The proof is identical to that of Lemma 2. 4, and will, therefore, 
be omitted. 

Remark 4. 2. It is easy to see that the control system defined by 
restricting F to I(D,x) satisfies the same assumptions as the original 
system. Since it can be readily verified that the map u -*• H(x, u, t) is 
continuous as a map from into M, it follows that this map is also 
continuous as a map from into I(D , x) . 

We now want to obtain a result for A^x, t) which is similar to that 
of Lemma 4. 1. It is here that the assumption about Q. will be utilized. 
Let iPbe the class of piecewise constant valued functions defined on 
[0,»). Clearly, is dense in ^ . Moreover, the local connectedness 
of £2 implies that the condition of Remark 3. 8 is satisfied (this can be 
easily v rified, and we omit the proof). Thus, we can apply Lemma 
3.7, with C =^and K =4!/, to obtain the following result: 

LEMMA 4. 3. Let x e M_. For each t a 0, A(x, t) OI*(D, x). 

Proof . Since ^contains we have that L x (D,t) C- A(x,t). Let 
G : fyl -*I(D,x) be defined by G(u) = TT(x,u,t). We have that G^) = L 

X 

(D,t) and by Theorem 3.9, G(^) CZ Iq(D,x). Now our conclusion 
follows immediately from Lemma *3. 7, and the proof is complete. 

The above lemmas combined with the theorems of the preceding 
section yield the following results: 

THEOREM 4. 4. Let x eM. Then A (x) C I(D, x). Moreover, 
for every T > 0, the interior of A(x,T) relative to'I(D,x) is dense in 
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A(x,T)(and, in particular, is non-empty). 

Proof . The first part is just the statement of Lemma 4. 1. To prove 
the second part, we can assume that I(D,x) = M (if not, replace the 
original system by its restriction to I(D,x), cf. Remark 4. 2). Since 
L (D,T) is dense in A(x,T), our conclusion follows immediately from 
Theorem 3. 1. 

THEOREM 4. 5. Let x e jM. Then, for each t > 0, A(x,t) Cl 
Iq(D,x) and, moreover, the interior of A(x, t) relative to Iq(D,x) is 
dense in A(x, t) (and, in particular, is non-empty). 

Proof . The first part is just the statement of Lemma 4. 3. To prove 
the second part, we apply Lemma 3. 7 to the function G of Lemma 4. 3, 
and we get that G is continuous as a map into I^(D,x); therefore, 

L (D,t) is dense in A(x, t) relative to ljl(D,x). Our conclusion now follows 
immediately from Theorem 3. 9, and the proof is complete. 

The following two controllability criteria follow immediately from 
the Theorems 4. 4 and 4. 5, and from Lemma 2. 1; , 

COROLLARY 4. 6. The system has the accessibility property 
from x if and only if dim^(D)(x) = dim M. In this case A(x,T) has a non- 
empty interior for every T > 0. 

COROLLARY 4.7. The system has the strong accessibility 
property from x if and only if ^q(D)(x) = dim M. In this case A(x,T) has 
a non-empty interior for every T > 0. 

The preceding results can be utilized to derive relationships 

* 

between accessibility and strong accessibility. Even though the latter 
property seems much stronger than the former, we show that, for a very 
large class of manifolds (including the spheres S n for n> 1, and all com- 
pact semisimple Lie groups, but not R 0 ), it is in fact implied by it. 
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On the other hand for a still larger class of manifolds (including R ri ) 
controlability (which trivially implies accessibility), is sufficient to 
guarantee strong accessibility (the fact that controllability implies 
that dim ,^(D*)(x) n n + 1 for all jc was proved by Elliott in [5]), 

Consider a system on a connected n-dimensional analytic 
manifold M, having the accessibility property but not having the strong 
accessibility property. Let Dbe the family of associated vector fields. 
By Corollary 4. 6, dim#"(D)(x) = n for all x e M. By Corollary 3. 4 the 
number dim ^T^(D)(x) is independent of x. Since this number is either 
n or n -1 , Corollary 4. 7 implies that dim ^g(D)(x) = n - 1 for all 
x e M. Choose a fixed X e D, and use $ to denote the one-parameter 
group generated by X(i. e. , for every y c M, the integral curve of X 
that passes through y at t = 0 is the curve t -* f (y)). Define a mapping 
F from the manifold SXR into M by 

F(s, t) = $ t (s). 

One shows easily that F is a local diffeomorphism onto M. 
Moreover, SXR is connected. In fact, we have (cf. [18], Ch. 2, for 
the definition of a covering projection): 

LEMMA 4. 8. The map F is a covering projection. 

Before we prove Lemma 4. 8, we show how the results mentioned 
above follow from it. 

THEOREM 4. 9. Let Mbe a manifold whose universal covering 
space (cf. [18]) is compact. Then every system having the accessibility 
property has the strong accessibility property. 

Proof. If the universal covering space of M is compact, then 
every covering space of M is compact. If it were possible to have 
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a system on _M_ having the accessibility property but not the strong 
accessibility property, we could define, for such a system, Sand F 
as above. It would follow that SXR is compact, which is clearly a 
contradiction 

Remark. If n > 1, the sphere S n is simply connected (and compact). 
Therefore Theorem 4. 9 applies. Also, if _M. is a connected compact 
semisimple Lie group (for instance SO(n), if n > 2) , the universal 
covering group of M is also compact (cf. [7], p. 123) and, therefore. 
Theorem 4. 9 applies in this case as well. 

THEOREM 4. 10. Let M_ be a manifold whose fundamental group 
has no elements of infinite order. Then every controllable system on 
M has the strong accessibility property. 

Proof . A controllable system obviously has the accessibility 
property. Assume it does not have the strong accessibility property. 
Define S and F as before. We show that F is one-to-one. Otherwise, 
there would exist s^, s q e S and a T N such that F(T, s^ ) = $ ^(s^) = 

F(0, s A ) = s . Therefore * ™(S) = S. Define H: SXR -»> SXR by 
H(s,t) = (* ^(s), t-T). Then H_ is well defined, because » ^(S) = S, and 
is a homeomorphism. Moreover, if (s,t) e SXR 

F(H(x,t)) =$ t _ T U T (s)) =* t (s) = F ( s , t) . 

Therefore H is a covering transformation (cf. [18], Ch. 2). 
Moreover, H has infinite order, because H m (s, t) = (J' m -j.(s), t-mT), 
so that H m is not the identity map if m ^ 0. We know from [18] Ch. 2 
that the group of covering transformations of the covering space 
(SXR, F) is isomorphic to a subgroup of the fundamental group nof M. 
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If rrhas no elements of infinite order, then this is a contradiction. 
Therefore F must be one-to-one. On the other hand, the points that 
are attainable from x^ must belong to S f (= $^(S)) for some nonnegative 
(cf. Theorem 4. 5). Therefore the points in S are not attainable, 
if t > 0. Thus, the system is not controllable, and we have reached a 
contradiction. 

t. 

Remark . Theorem 4. 10 applies, in particular, to any simply 
connected manifold, such as R n . 


Proof of Lemma 4. 8 . We must show that every point of M has a 
neighborhood that is evenly covered by F. Let m e M. Since F is a 
local diffeomorphism onto, there exist s e S, t eR , e> 0 and a connected 
neighborhood U of ji in S such that F(s, t) = m and that the restriction of 
F to UX(t-e, t + g ) is a diffeomorphism onto an open subset V of M. 


We claim that V is evenly covered. Let A = {t : $^(S) = S } . For each 
r e A, let U T = $ T (U). Since 4 t : S S is a diffeomorphism, it follows 
that is open in Jj and connected for each r e A. We first show that, 
if 0 < | r - nj <2 e, r e A, ne A, then and l) are disjoint. Assume 
they are not. Then 4 ^(U^.) and 4,p(U^ are not disjoint, for any T . Choose 
T such that both T^ + Tand1[+r| belong to (t-e, t + e). Let u = ^^ +T ( u j) = 


4 rp ■ (u,) be a common element, where u. and u~ belong to U. Then the 
1 + T\ d Id — 

points (UjT + t) and (u^.T + r|) belong to UX(t-e,t+g). Since the re- 


striction of F to this set i* one-to-one, it follows that t = p, which is 


a contradiction. For each t eA, let W = U X (t-r- e, t-r+g). We shall 

T T 

conclude our proof that V is evenly covered by showing: 
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(a) the sets W^are open, connected and pairwise disjoint, 

(b) for each t e A, _F maps W diffeomorphically onto V, and 

A 

(c) the inverse image of V under F is the union of the sets W . 

/ — — T 

The first two assertions of (a) are obvious. If t and m belong to 

A , and r t p , then either j t - n | < 2 e or | T-t] | ? 2 e . In the first 

case W and W must be disjoint, because U and U are disjoint. 
t n t r\ J 

In the second case, W_ and W are also disjoint, because the intervals 

7 r\ J 

(t-T- e , t-r + e) and (t-rf s, t- r|+ e) cannot have a point in common. 

T o prove (b) , take t e A. Define G: UX (t-e,t + $ -*■ by 
G( u , ,- t ) = ( » T (u), a - T ). Clearly, G is a diffeomorphism from UX(t- e , 
t + e ) onto W . Moreover ifueU, t - e < a <t + e, then F(G(u,cr)) =$ ($ ( u)) 

' (J-T *T 

$ (u) = F( u, /^ ) . Since the restriction of F to UX(t-e , t+e) is a diffeomor- 

n 

phi sm onto _V, the same must be true for the restriction of F to . 

Finally, we prove (c). Let u e S, <y e R be such that F(u,^) e V. 



Then there exist u : e U, © ' e (t-e , t + e ) such that F(u\ = F(u, o). 

Therefore u = $ , Ju'). This implies, in particular, that t = o ' - 0 

belongs to A_, and that u e . Moreover, since t - e < o 1 < t + e, 

it follows that t - r - e <9 < t - T+e. Therefore ( u, a ) e W . 

T 

The proof Lemma 4. 8 is now complete. 

5 . Examples . 

Example 5. 1. Let M = R n , Q = R m , and let F : M X -> TM 
be defined by F(x, u) - Ax + Bu, where A and 3 are, respectively, 
n X n and n X m real matrices. We have that D = {A(‘ ) + Bu : u e R m }, 

Let b. denote the i_-th column of EL Then, as shown by Lobry [16], 

^■(D)(x) contains the vectors: 
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n- 1 

Ax +b. , + Ab + A b. i = 1 , . . . , m. 

— i — 1 — i 

It is not difficult to see that the above set of vectors forms a 
system of generators for ^"(^)(x). From Corollary 4. 6 we gel that 
A(0 , t) has a non-emoty interior in P n if and only if { J rb., +Ab 

~ ~ — i — X 

n- 1 

+ A b. , i = 1,2,..., m} has rank n ; equivalently. A(0 , t) has a non- 

— I J ~ ^ 1 11 


n- 1. 


empty interior in R if and only if ran k [B,AB, . . . ,A B] = n. 

Since, obviously, (D)(0 ) =^T(D)(0), we conclude thatA(O.t) 
has a non-empty interior whenever A(0,t) does. Tiie above statements, 

/-V-/ 

along with the fact that A(0, t) and A(0, t) a: linear subspaces of R n , 

n- 1 

imply that, if rank [B, AB, .... A B] = n, then for each t > 0 
A(Q,t) =A(0,t) =A(0) = R n (Kalman [l2]). Thus, in this example, the 

/NJ <V 'V/ «S/ 

accessibility property is equivalent to controllability. Th’*s is, of 
course, not true in general. 

Example 5. 2. Let M = R n , = f u e F m : u. 5 1, i=l,..., m}, 

and lei F(x, u) = (A a + 2 m A .u.)x for all (x, u) e R n X ft, where A^, . . , 

0 i = l ii — — ~ 0 


A are nXn real matrices. Then D is the set of all vector fields X 


m — u 

where X (x) = (A_ + u.A.) x. The set M n of all nXn real matrices 

u 0 i=l i i ~ 

is a Lie algebra, with the bracket defined by [P,Q] = PQ - QP. To 
each matrix P there corresponds a vector field V(P) defined by V(P)(x) = 
Px. It is easy to check that V([P,Q]) = [V(p),V(Q)]. Using this fact, 
the spaces iF(D)(x) and tF^(D)(x) can be readily computed: 


J-(D)(x) = fPx : P S L] , 


and 


# 0 (D)(x) = {Px: P s L } 
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where L is the Lie algebra spanned by A_, . . . , A , and L is the ideal of 

L spanned by A A . We remark that for this example the theory 

of Section 4 is valid even if <4/ i s the set of all bounded and measurable 

Q - valued function s. This is so because the only properties of the 

class of admissible controls that were utilized in Section 4 were: 

(a) that the class of piecewise constant controls is dense in ^ (in the 

topology of uniform convergence), and (b) that, if fu } are elements of 

1 

that converge uniformly to _u, then IT(u ,x,^) converges to TT^u,x,t). 

a " 

In our example, both (a) and (b) remain valid if the topology of 

a 

uniform convergence is replaced by that of weak convergence. This 

is easy to verify, and we shall not do it here (see Kucera [14]). 

Moreover, the set of f^-valued measurable functions defined in [O.T] 

is weakly compact. It follows that the sets A(x,T), A(x,T) are com- 

pact for each T >0. Denote their interiors (relative to I(D,x) and 
T 

I (D,x) respectively) by Int A(x,T), Int A(x,T). It follows that 
A(x,T) is the closure of In*- A(x,T), and that A(x, T) is the closure of 
Int A(x,T). Therefore, our results contain those of Kucera (in this 
connection, see also Sussmann [21]). 

Remark . The result of the preceding example is a particular 
case of a more general situation. Let G_ be a Lie group, and let M 
be an analytic manifold on which G acts analytically to the left. 

Then there is a homomorphism X from the Lie algebra of G into 
V(M), defined by 

X(X)(m) = (exp (tX) • m), 

the derivative being evaluated at t_ =0. If X^, . . . , X^ belong to 
the Lie algebra o' G , we can consider the control problem 
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dx 

dt 



k 

+.2. u.X'.(x), 

i = l i i 


where X! = \(X.). Example 5. 2 results by letting G = GL(n, R) and 
11 ~ 

M = R . 

/v; 

Example 5. 3. This example shows that the analyticity assumptions 
are essential. Consider the following two systems defined in the (x,^r) 
plane: 

(Sj) x = f j(x, y, u) 

V = gj( x » Yf u ) 

and 

(S 2 ) x =f 2 (x, y, u) 

y = g 2 ( x .y.u) 

Let f .ae. f, =. 1 , g s 0, and g_(x, y, u) = cp (x) where cp is a C°° function 
which vanishes for -»<x < 1, and which is equal to 1 for x > 2. It is 
clear that for (Sj) the set A((0, 0)) is the half line {(x, y) : y = 0, x a 0} 
while, for (S.,), A((0, 0)) has a non-empty interior. However, both systems 
are identical in a neighborhood of (0,0). 
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CONTROL SYSTEMS ON LIE GROUPS* 


Velimir Jurdjevic** and Hector J. Sussmann*** 

1. Introduction 

In this article we study the controllability properties of systems 
which are described by an evolution equation in a Lie group Ci of the 
form: 

(♦) If - = -0 ,X,l)) + 2 f=l-i (i) -i (iy)) 

where Xq . . . . »2£ m are right- invariant vector fields on G. Systems 
described by (+) we term right- invariant. This study is based on the 
results of [11], and is related to the work of R. W. Brockett [ 1 ]. As 
remarked by Brockett, there are many important applications in 
engineering and in physics which are not treated by classical control 
theory because of the assumption that the state space is a vector space. 
In particular, when controlling the orientation of a rigid body relative 
to some fixed set of axes, the state space can be naturally regarded as 
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SO(3), the group of orthogonal 3x3 real matrices with positive determinant. 
The evolution equation describing this system is of the form given by 
(+) ( [ 1 ])- Instead of restricting our study to groups cf matrices, we con- 
sider systems described in an abstract Lie group G. This generalization 
in no essential way affects the nature of the problem. 

From the theoretical point of view a study of systems of the form 
(+) appears natural for several reasons. For instance, the algebraic 
criteria developed in [ll] can be used to obtain global results by exploit- 
ing the algebraic structure of the state space and the sets attainable from 
the identity. In this regard, the analogy with the controllability of linear 
systems is striking. 

In this article we shall look for necessary and sufficient conditions 
for a right- invariant system to be controllable. A necessary condition 
is that the system have the "accessibility property" (cf. [ll]). We show 
that this condition is also sufficient if G is connected and if either (a) the 
system is homogeneous (i. e. Xq = 0) or (b) G is compact. When neither 
(a) nor (b) hold, accessibility (plus the connectedness of G) is not suf- 
ficient for controllability. In this case we give some sufficient conditions, 
and a necessary condition, and we single out a particular situation in 
which a necessary and sufficient condition can be obtained. 

An obvious necessary condition for controllability is that the set 
A(e) of points reachable from the identity of G be a subgroup of G. Thus, 
the controllability problem reduces to the following two: 

(a) when is A (e) is a subgroup? , and 

(b) if A (_e) is a subgroup, when is A (e) = G? Question (b) is much 
easier to answer than question (a). In Theorem 4. 6 we show that if A^e) 
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is a subgroup then, necessariLy, this subgroup is the connected Lie 

subgroup S of G whose Lie algebra is the subalgebra L generated by 

X ,. . . ,X . From this it follows that the system will be controllable if 

. and only if ( 1 ) A(e) is a subgroup, (ii) G is connected, and (iii) L is the 
^ ^ 

Lie algebra of G. This shows that the crucial question is that of determin- 
ing when A(e) is a subgroup. 

This question is (partially) answered in Sections 5 and 6. 

The organization of the article is as follows : In Section 2 we intro- 

duce notation and basic concepts: in addition, we quote a result about 
Lie groups which will be used Leer. In Section 3 we single out the 
relevant Lie algebras induced by a right in Tiant system. In Section 4 
we derive the basic properties of attainable sets. In Section 5 we study 
the homogeneous case, and in Section 6 we study the general case. In 
Section 7 we interpret our results in terms of controllability. 

Finally, Section 8 contains examples. 

2. Preliminaries 

We shall assume that the reader is familiar with the basic facts 
about Lie groups (cf. [2], or [4], or [5]). 

Throughout thi3 paper, G will denote a Lie group, and Ij(G)will 
denote the Lie algebra of G. We shall think of L(G) as the set of vector 
fields on G that are invariant under right translations. It is known 
that every X e L(G) is analytic, and that L(G) is a Lie algebra with the 
obvious vector operations, and with the Lie product defined by 
[X,_Y] = X Y - Y X 

The exponential map from L(G) into is denoted by exp. Recall 
that exp(0) - e (the identity element of £), and that, for each X e Ij(G), 
the curve -* exp (^X) ' 8 an * n teg r al curve of X. 
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We recall that there is a one-to-one correspondence between Lie 
subalgebras of L(G) and connected Lie subgroups of G. If H is a con- 
nected Lie subgroup of G, the Lie algebra L(H)is naturally identified 
with a subalgebra of L(G). We shall also denote this subalgebra by 

L(H). 

Let X X be elements of L(G). We shall consider the 

following control system defined on G: 

(1) x(i) =X 0 UU)) + r2 u.(t) x.isU)) 

where u = (u ,, . . . ,u ) belongs to the class of admissible controls U. 

Throughout the article we shall assume that IJ is one of the classes 

U ,U or U, , defined as follows: 

— u — _r — b 

(i) is the class of all locally bounded and measurable functions 
defined on the interval [0,«) having values in R— . 

(ii) U[ r is the subset of tJ^ consisting of all elements which take 

values in the cube {x e : |x. j - 1, i = 1 m). 

(iii) is the class of all piecewise constant functions defined on 
[0,«») with values in R — such that the components of its elements only 
take values 1 and -1. 

We will refer to U , U and U, as the class of unrestricted, 

—u — r_ — b — 

restricted and "bang-bang " controls, respectively. 

If X = (X n , . . . , X ) is an m + 1 - tuple of elements of L(G), and 

~ — 0 ni — 

if U is a class of admissible controls, then the system described by equa- 
tion (1) will be termed right- invariant. For notational convenience, we 
will denote such a system by PC, U). We will also adopt the convention 
that if in a particular statement IJ is not specified explicitly, we will 
mean that such a statement is true for any class of admissible controls 
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(i. e. , U , U or II ). 

— u — _r — b 

We have the following basic fact: 

LEMMA 2. 1. Let (X, U) be a right -invariant system on G, and l et 
u c U. Then for every g c G, there exists a unique solution ^ x of ( 11 
defined for 0 < t < « , such that x( 0) = £. 

Proof . Uniqueness and local existence follow from the standard 
results on ordinary differential equation. Moreover, we know from these 
results that there is a m aximal interval [0,T) (T >0) on which there 
exists a solution x of ( 1) withx(O) = jr. We show that =» . Assume 
T^<<®. Let jr(t) be a solution of ( 1) defined for - 6<t^ < TT + 6, where 
6 > 0, and such that jr(T) = e. Let g_' = jr(T - 6 ), = x(T -■£*). 

Let z (t) be defined by 

_z( t) = x(jy for 0 ‘ | 6 

z(t) =_y(t) £' for T - j 6 <J < T + 6 . 

Than j;(t) is a solution of (1) which satisfies zj(0) = £and is defined for 
0 ^ t< _T + f) . This contradicts the maximality of the interval [0,_T). 
Therefore TT = ® and our proof is complete. 

If u £ IJ and £ eG we will denote the solution x of (1) which satisfies 
x(0) = £ by tt (g, u, • ); i. e. , x(t) = n(_g,_u,l) * or all t, e [0,»). If, for some 
t a 0, n (^,u, t) = , we say that the control u steers £ into g 1 in t 

units of time . If there exists ae U which steers £ into £' in _t units of 
time , we say thatj* 1 is attainable (or reachable ) from g at time t. The 
set of all £’ c G which are attainable from £ at time t_ will be denoted by 
A(g,t). We shall also use the notations 



- 100 - 


A(l.J) 

A(*T) 

We shall refer to A(g) as the set attainable from g. 

— * r ~ 1 1 r 

From the right invariance of our control systems it follows 

(21 

trivially that A(_g,T) = A(e,JT)£, A(g,T) = A(e,T)j», and A(g) = A(e)g_ v 
Therefore, without loss of generality, we can limit ourselves to the study 
of the sets attainable from the identity . 

We finish this section by quoting a result about Lie groups whose 
proof can be found in [12] (cf. - also [5], pp, 27 5). 

THEOREM 2. 2. Let G be a Lie group, and let H be a path-connected 
subgroup of G. Then H is a Lie subgroup of G. 

3. The Associated Lie Subalgebras . 

To every right- invariant control system (X,_U)on a Lie group G, 
we shall associate the following three Lie subalgebras of L(G): 

( 1) The subalgebra L generated by X^,. . . *2£ m * 

(2) The ideal of L generated by X. , . . . ,X . This ideal will be denoted 

~ — *1 — -m 

b yfc o- 

(3) The subaigebra h of Ij(G) generated by Xj> . . . »2£ m ’ 

We denote the corresponding connected Lie subgroups by and 


A(£,T) = 


A (£) 


U 


O^tsT 


0 ^ t < 00 


S. We have: 
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LEMMA 3. 1. 

<*> ^toC h 525 S C S oC: S. 

(ii) L n is a subspace of L of codimension zero or one, 

^ U * - — ' ■ ■ — — 

(iii) S „ is a normal subgroup of S . 

~ u ^ 

Proof , (i) and (ii) are trivial, (iii) follows from the fact that a 
connected Lie subgroup H of a connected Lie group K is a normal subgroup 
of K if and only if L(H) is an ideal of L(JK) (cf. [2], p, 124). 

We shall use the notation S n — • for the coset of S modulo S,. which 
contains exp (^ ' 

4 . E lementary Properties of the Attainable Sets . 

If (X,U) is a right- invariant control system on G, then the vector 

fields X n# . . . ,X belong to the Lie algebra of S . Therefore, we '•an 
— " — m ~ 

consider (X , U) as a right- invariant control system on S, and Lemma 2. 1 
will be valid if G is "eplaced by S. This gives 

LEMMA 4. 1. If (X, U) is a right-invariant system on G. then A(el 
is considered in S. 

_ w rss 

\ 

The following lemma states a similar result for the sets j\(e, t). 
LEMMA 4. 2. If (X,U) is a right-invariant system on G. then for 
each t^> 0 Aje, t) is contained in Sq -~. 

It would be easy to prove this lemma directly, but since this re- 
sult is included in that of Lemma 6. 1 we omit the proof. 

We next derive some el emenlary topological properties of the 
attainable sets. If T > 0, we will denote the set of all restrictions of 
elements of U to [0,T] by JJ(T). 

LEMMA 4. 3. Let (X, U) be a right- invariant control system on G. 
The mapping (u,_t) -*»rr(g, u,_t) from U(T) x fO.Tl into G is continuous for 
each jy a nd each T > 0. if U(T) is given the topology of weak convergence . 
The proof of this result appears in [10], and therefore we will omit it. ' ' 
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From this we obtain: 

LEMMA 4.4. Let (X.U) be a right-invariant control system on G. 
(i. The sets A(e,T), A(e), A(e, T) are path-connected, for each 

T >0. 

( ii) Jf U = U r then A (je , ^T) and A j., T) are compact . 

Proof, (i) will be an immediate consequence rf the fact that U(T) 
is path- connected and of Lemma 4. 3. The path-connectedness of 
U(T) is trivial in the unrestricted and in the restricted case. In the 
"bang-bang" case, let_u and v belong to U(TJ. 

For each t^ such that 0<^<T, let w^ be defined by 

w fc (t) = v(t) if 0< T < t_ 

w \t) = u( t ) if t_ < T < T 

Then w t e tJ(T). Moreover, w^ = u and w^ - _v. Since £ — w^ 
is a continuous path in U(T), it follows that U(T) is indeed p ith-con- 
nected. 

To prove (ii) we remark that, if IJ is the class of restricted 
controls, then U(T) is compact in the weak topology. The proof is 
now complete. 

In regard to the algebraic properties of the attainable sets we 



have the following: 
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hen the set A (e) is a semi-group. 

Proot . Let and belong to A (e) . Let £ = ti t) , j[' = 

TTje.ji'.P). Let the control v be defined by 


v(t) = u(t) for 0 < t < t_ 

v(t) = u'(t - t) for + V 

Then tt(^, _ v, t_+ V) = , and therefore, g_'g_ e A (e) . The proof 

is then complete. 

" T e cannot a ssert, in general, that A (e) is a group. However, 
the following theorem tells us that, if A(e) is a group, then ic must 
be the group S. 

THEOREM 4. 6. Let (X. U) be a right- inv ^- iant control system 
on O. If A(_e) if a subgroup of Q, then A(e) = S. 

Proo f. We know that A (e) is path- connected. If A(e) is a sub- 
group, it follows from Theorem 2. 2 that it is a Lie subgroup of 
G. Let A he its Lie algebra. Then Ad L, because A(e)CS 
(Lamma 4. 1). Or the other hand, let a 1 (^j» • . . » a m ) he an m- tuple 
such that each a is + 1. 
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Let u be the constant control u = (a. , .... a ). Then a e U and, 
— — — 1 • — m — — 

therefore, the curve t -* n (e,^j, t) (0 » ) is contained in A (e). 

In other words, if we let 


m 


X (a) = X A + 2— a . X. 

— ~ —0 • , — i — i 

i = l — — 


it follows that exp(t X(a)) belongs to A (e) for all t - 0. Since A (e) is a 

subgroup, this will be true for all real t.. Therefore, (cf, [4], p. 94), 

we can conclude that X(a) belongs to t \ . Since the elements X(a) 

form a system of generators of L, we conclude that L C A and, therefore, 

/■«*/ 

L = A and A (e) = S. 

5. The Homogeneous Case 

A right- invarient control system (X,U) is homogeneous if X rt = 0, 

a/ — u 


As an introduction to the general case, we consider these systems first. 

The result stated in the next theorem appeared first in a study by 
R.W. Brockett [l]. 

THEOREM 5. 1. Let (X, U) be a homogeneous right- invariant 
control system on G. Then the set attainable from the identity is the 
subgroup S. Moreover, _if JLJ is unrestricted then, f or each T > 0 , 

A v e, T) = A(e) - S. 

Proof . To prove the first statement it is sufficient, in view of 
Theorem 4. 6, to show that A(e) is a subgroup. We know that^(e) is a 
semigroup. It remains to be shown that, if ^ e A^e), then 8 e A(e). 
Let n ( e_, _u ,jt) = g, where u e U , t^ * 0. Let 

v(_s) = - u( t_ - _s) for 0 s_8 ^ L 
v(s) = for > t. 
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Obviously, v e U. Let 


Then 


f( s) = TT (ji, U_, t_ - S) . 


f(s) = 2— V (S) X (f( 8 )) . 
Ml - - 


Therefore, Ms a solution of the evolution equation corresponding 
to the control v. By the. right- invariance we must have fj^} = s] h, 

where Ji = f_(0) = g. But f(t) = n (je,_u, 0) - e. Therefore, tt (e, v, t) = j»”*, 
and we have shown that g 1 e A(e). 

To prove the second statement, assume that U is unrestricted. Let 
g = tt (.e,.u, t) for some u c and ^ > 0. Let ^ > 0, and define a control 
v by 

vj t) = ( t_/js) u( T t/s) for 0 ^ T < od . 


An easy computation shows that rr(e. v. s) = g. We have therefore shown 
that A(£,_t)C. M£,_s). Similarly, A( e, s) e , t) . Thus, A( e, s) = A^e,_t) 
for all L jj such that 0 < t_, 0 <js such that 0 <t_, 0 <^. Our proof is then 
complete. 

Remark . The previous theorem implies that, for a homogeneous 
system 

(a) The attainable set A (e) is a subgroup of G. 

(V ** 

(b) The set A (e) is the same for the three classes of controls, so 

— * 

that, in particular, every ^ e that can be reached from the identity by 
means of an arbitrary control, can also be reached by means of a 
"bang-bang" control (possibly at a later time). 
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(c) If U «U , then every j; cG that can he reached from the identity 
can in fact be reached in an arbitrarily short time. 

We shall see later that neither (a), nor (b), nor <c) need be true in 
the non-homogeneous case. 

6. The general Case . 

Our subsequent study will be based on the following lemma. 

LEMMA 6. 1. Let (X , U) be a right-invariant control system on G. 

^ — ‘ r " ' " ' ' ■ “ ” " 1 ■ ■ — * 

T hen for each T > 0 , 

(i) A (e,T) is contained in S , and the interior of A (e,T) is dense 

' — 1 ■ - — *■■ — ■■ ■ ■■ ■ -« i i — — — - - 

(in the topology of S ) _in A(e_, T). 

T 

(ii) A(e,T) is contained in S-^ , and the interior of A (e,T) is dense 
(in the topology of S-^ ) in A^e, T). 

Proof. We shall use *he results of [11]. Our system is of the 
form considered in the Remark following Example 5. 2 of [11], with M=G, 
and with G acting on G by left translations. In the notations of [11], we 
have = R — , or 17 = C — (the unit cube in R , or Q = V— (the set of 
vertices of C— ) in the unrestricted, restricted and "bang-bang" cases, 
respectifely. In each of the three cases, the assumptions of [1 1 ] are 
satisfied, and an easy computation shows that^(D) = L and that «/ n (D) = L_. 

Since S is the integral manifold of L through e (cf. [2] p. 1C8), our first 

T» 

statement follows from [11]. Similarly, it is easy to verify that S r is 

-v 0 

T 

precisely the submanifold^— (D,jfi) of [11], and the second part of our 
lemma follows. 

In particular, it follows from Lemma 6. 1 that the interior of A(e) 

■ K 1 - _ ^ — • 

relative to S is nonempty. 


We shall also need the following: 
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LEMMA 6.2. Let H be a connected Lie group , and let L^ 

be elements of L (H) that genera te _L(H) . Then every h eH is a finite 
product of elements of the form e xp(t L.), where t is real and i = 

1» ... n. 

Proof . The set H' of all finite products of elements of the form 
exp(t^L^) is obviously a path- connected subgroup of H. Therefore, JH' 
is a connected Lie subgroup of H(cf. Theorem 2.2). Obviously, H/ 
contains the one-parameter subgroups generated by L^ f .. . , L^. Therefore, 
(cf. [4], p, 94) Lj, . . . , L^ belong to L(H'). Then, H* =Jd, and our 
proof is complete. 

LEMMA 6. 3. Let (X, U) be a right- invariant control system on G. 

If the set attainable from the identity is dense in S , then it is equal 
to S . 

Proof. Let g e A (e) belong to the interior of A(e) relative to S 
(cf. Lemma 6. 1). Let V cA(e) be relatively open in S and such that 
jg c V. Let W={h he V], Then W is a nonempty relatively open 
subset of S. Our assumption implies that W contains an element h of 
A(e); then the set V h (cf. footnote (2)) is re.atively open in S, and is 

— ** 'v 

contained in A(e). Moreover, V h contains the identity. Therefore, the 
semigroup A{e) contains a neighborhood of the identity in S . Since 
S is connected, we have that A (e;) = S, and our proof is complete. 
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LEMMA 6. 4. Let(X, U) be a ' .^ht- invariant control system on G 

with U = U . Then SCA(e) (the closure is taken relative to S). 

Proof . By Lemma 6. 2, every element of S is a product of elements 

of the form exp ( t_ X.) ( - o° < t_ < oo , j_ = l,... , m ). We show that exp( t^ X) 

belongs to A ( e) for every real^. and for every i_= 1 m. Since 

A(e) is a semigroup, this will imply that S C A(e), and the desired 

conclusion will follow immediately. 

Let t^ be a real number, and let 1 <^<_m. Let _u be the constant 

control (0, . . . , 0, n, 0, ... 0) where n appears in the j_-th position. Then 

u e U for each n > 0. We have 
— n — — 

tt (^,u .t/n) = exp(X Q + n X 
= exp((t^/n) Xq 

. I 

Letting n -*>» , we conclude that exp(t X. ) e A(e) , and our proof is com- 
— — — — 

plete. 

Remark . If U is not unres: *’cted, then^l need not be contained in 
A(e) (cf. Example 8. 4) , 

W » can now prove: 

THEOREM b. 5. Let be a right- invariant control system on 

G. Assume that the subgroup S is compact. Then 
(i) A(e) = S. 

(ii) There exist a T > 0 such that A (e, T) = A(e). 

Proof. Let H be the closure of A (e) relative to S . Ther H is a 

■ ■ ■■ ■ 1 0SS "** '*** 

semigroup. We show that is a group. Let h e H. Then, for every 
; . ive integer n, h— eH_. The sequence {h— } 


has a convergent 
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subsequence f h— ^ } , and we can assume that n(k) < n(k +1) for 

K A f • • * " 

all k.. Now, as jc -* oo, h = lim h ^— ~ * = lim h . Since £(j< + 1) - 
n(}c)-l is non- negative, it follows that h belongs to H for each k. Since 
H is closed, h ^ e H. Therefore, H is a group. Since A(jj) dL anc * 

A(e) has a non-empty interior relative to S, the same is true for H. Since 
H is a group and S is connected, we conclude that H = S. Therefore, 

A(e) is dense in S , and (i) follows from Lemma 6. 3. 

To prove (ii) we let W(t) denote, for each t^> 0, the interior, 

relative to S, of A(e,t). It is easy to see that the union of all the sets 

W(t) is^5(if j* fS , let j* e A(e, T); let _h be interior to A(_e,_T_’) and let 

h" 1 e A(e, T"); then g e W(T + T 1 + T"j). 

— * — * 

Since the sets W(_t) are increasing, it follows that W(t) = S for 
sufficiently large T, and our proof is complete. 

Remark . Theorem 6. 5 shows that, if S is compact, then conditions 
(a) and (b) of the remark following The r«-m 5. 1 are satisfied. However, 
in this case condition (c) need not be satisfied. Even if JJ is unrestricted, 
it may not be possible to reach every element of S in an arbitrarily small 
time (cf. Example 8. 1). 


If S is not compact, then A(e) need not be equal to S. The following 
theorem gives a sufficient condition under which A(e) = S; we do not know 
if this condition is also necessary. 



A(e) = S. 
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Proof . Let ja and {t^} satisfy the conditions of the theorem, and let 
lim tt(.£, Ui _t_ n ) = x • If X = 2£o + _u . ■ then, since u is constant, 

rr(j? ,_u,_t_) = exp(^_ X)* We first show that exp( t_ X) £ A (e) for every real 

number t_. If {^. n } is bounded, then there exists a positive number T 

such that exp(T^X) £ J>. Let _ -'e any real number, and le< n be a 

natural number withjiT^ + t_ > 0. Sit.ce S is a group we have that exp 

(-T X) £ S, and hence, exp(-TjiX) € S. By Lemma 6. 4, it follows 

that exp ( - T n X) e A (e). Since, obviously exp((jiT^ + t^)X) e A(e), we 

have that exp(t_X)= exp (-'T ji X) * exp ( ( -JT jn + t_) X) . and hence, exp(^X) 

eA(e). If { t ) is unbounded, let ft ) be a subsequence of {t } with 
— — k — 

t -t > k, and let t, = t -t We have that t, oo 

“2k + 1 -2k - i — 2 k + ! -2k i 

and exp t X — - ask — ®. Thus, for any real number t_, exp(t_ X) = 
lim exp((_t + )3£). 

k -oo - 

If k is sufficiently large, then ^ + r is positive. Therefore, exp 

Ev 

((t + r )X) belongs to A(e) for k large. It fellows that exp(t X) e A(e). 

By Lemma 6. 4, exp(t^ X- ) belongs to A^e) for every real and 

every i = 1, . . . , m. Since A (e) is a semigroup, it follows that every 

1 1 /%# “ 

product of elements of the form exp( t_ Y _ ) (t^ real, _Y e { X*2£j X }) 

belongs to A(e). Clearly, the elements X, X, X generate L. 

By Lemma 6. 2, A(e) = S. This completes the proof. 

***** 

The following corollary is immediate: 

COROLLARY 6.7. Let (X, u) be a right- invariant control system 
on G with IJ = U . If there exists a constant control u_ such that t_ — 
Tr(e,u,t) is periodic, then A(e) = S. 
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The following lemma gives a necessary condition for A(e) to be 

/NJ m 

equal to S; however, this condition is not sufficient (see Example 8. 3). 

LEMMA 6. 8. Let (X, U) be a right- invarian t control system oa G, 
and let A(e) = S. Then, there exists a non-zero number T such that exp 

“ * --w rws ' i i t t, j_ . . - « — • — * ^ T - i ' ' * r r * * ' ~ ■ r r » 4 

o 1» e £ 0 . 

Proof. Our assumption implies that exp(-X^) belongs to A(e,_t) for 
some t_ - 0. Therefore, by Lemma 4.2, exp( -X^) = exp( t_ X where 
g eS n . To complete the proof, take T = - 1 - t. 

There is one important case when Theorem 6. 6 and Lemma 6. 8 
yield a necessary and sufficient condition for A(e) = S , namely when 
S = Sq . This will happen if and only if L = L q. It is easy to check that 
this equality holds if and only if all the brackets [Xq,X.] belong to JL(i = l,. . . m). 

THEOREM 6. 9. Let (X, U) be a right-invariant control system on 
G with U = U . If L = L n , then a necessary and sufficient condition for 

— jj ~ 0 — ■ ■ 

A (e) to be equal to S is that there exist a number T , T £ 0 , such that 
exp( T Xq) belongs to S. 

Remark. The condition L = L n holds, in particular, when 
[X ,X ] = 0 . . ,m) i. e. , when exp(t^ X^) commutes with the elements 

of S. 


7. Controllability 

Let (X , U) be a right invariant control system on G, and let g £ G. 
We say that (X , U) is controllable from g if A(g) = G. We say that (X , U) 
is controllable if it is controllable from every e G. 

THEOREM 7. 1 A necessary condition for (X , U) to be controllable 
is that G be connected and that L = L(G)_. If G is compact, or if the 
system is homogeneous , t he condition is also sufficient. 

Proc f . The condition of the theorem holds if and only if G = S. 
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By Lemma 4. 1, the condition Is necessary. The second part of the state- 
ment follows from Theorems 5. 1 ar.u 6'. 5 (and from the obvious fact that, 
if A(e) = G , then A(g) = G for every g). 

In the compact case, we car* prove stronger controllability properties. 

THEOREM 7. 2. Let G be compact, and let (X , U) be controllable. 

***** r 1 r * 1 --------- - f 

Then there exists T > 0 such that, for every G, e G, there is a 
control that steers g into g 1 in less tha n T units of time. If G is semi- 
simple , then there exists T > 0 such that , for every £ e G, j|' e G, there 
is a control that steers £ into j*' in exactly T units of time. 

Proof . The first statement follows from Theorem h. 5 (ii). To 
prove the second statement, we observe that, if G_ is semisimple, then 
(X,U) has the "strong accessibility property", i. e. the set A{e,t) has a 
non-empty interior for every t^ > 0 (for a proof of this, see [ 1 1 ]) . From 
this fact the conclusion follows as in the proof of Theorem 6. 5 (ii). 

Finally, Theorem 6. 9 can also be interpreted as a controllability 
result. 

THEOREM 7. 3. Assume that the necessary conditions of Theorem 

7. 1 hold , and that (i) U = , and(ii) L = L q ( or , equivalently , L is an 

idaal of L). Then (X , U) is controllable if and only if exp (T X rt ) 
belongs to S ( =S for some T ^ 0. 

COROLLARY 7.4. If G is connected, L= L(G), U = U and X_ belongs 

— — - ^ — — — u l 0 

to the Lie algebra generated by X ^ , . . . ,_X m> then (X, U) is controllable . 

8. Examples 

In most of the following examples, we shall work with groups of 
matrices. Our groups will be Lie subgroups of G L (n,R), the group of 
all n x n non-singular real matrices. Recall that G L (_n, R) is an open 
subset of M (n,^.) (the set if all n x n real matrices). Since M(n,R) is 

— — — * /-w — . — ^ 
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a vector space, we can identify the tangent space to £ J^(q, R) at each 

point with . With this identification, a right-invariant vector field 

corresponds to a mapping X -* A X from G L (n,R) into M(n,R), where 

A is a fixed matrix. If X , , X are right-invariant vector fields, 

— — 0 — m 

given by X -*A^ X(i=0, . . . , m), then the evolution equation becomes 

X(t) =(A 0 + U;(t)A.)X(i) . 

Example 8. 1. Let = _S 0(3^, the set of all 3 x 3 real orthogonal 
matrices with positive determinant. The algebra L(G) is the set of all 
3x3 antisymmetric matrices. A basis for L(G) is given by the matrices 



Thus VG) is isomorphic to three-dimensional real space, with the Lie 
bracket corresponding to the vector product. Usmg this correspondence, 
it is obvious that, if A and J3 are any two linearly independent elements 
of L(G), then { A_, B, [A_, Bj} is a basis for L,(G). 

Let A and B be any linearly independent 3x3 anti-sym.- ati’ic 
matrices, and let our right- i nvariant control system on SO(3) be described 
by 

X(t) = (A +uB)X(t) 
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where u belongs to any class of admissible control**. Since SO { is com- 
pact and connected, Theorem 6 . 5 pplies, t.r. I cu** system is controllable. 
Moreover, there is a T >0 such that, given any two elements P, Q of 
SO( 3) there is a "bang-bang 1 control u that steers Pinto Q in less than 
T units of time In this connection, it is interesting to observe that, 
in general, there may not exist arbitrarily small numbers T with the 
above property, even if the control u ; s completely unrestricted , i'ake, 
for instance, A ~ K ^ nd B - If u is an arbitrary control, and if 

X(t) is the solution of the evolution equation corresponding to u with initial 

condition X(0) = I, write X = (x..) . . , 7 a . Then we have 

~ ~ “ 'ij y z 1 >*-■«-' 


* 1 2 


— 22 + ~ 


4 32 


and 


-32 


' -- 12 


Multiplying the first equation by x the second equation by x and 
addi^-'j, we get 

1 - , 2 2 
2 dF 12 + i 32 ' * — 22 — IT. 


2 2 

Since Xj 2 + 32 , van I s ^ e ' : at t_ = 0 , we have: 


(is 12 t * ( 1 ) ~ 2 


— 32 * Vid 


/A 

J 0 


22 (t) ^12 (t) - T 


Butx^fT) and x ^(t) are entries of orthogonal matrices. Hence, tney 
are bounded in absolute value by 1. Therefore, we conclude that 


2 2 

( X ^2 h X 22)^ Ki t. 
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This shows that a matrix (a j.) for which + — 32 = * cannot be reached 

from the identity in less than Jr units of time . 

Example 8. 2 . The considerations of the previous example can be 

generalized to G = J30(n). In this case the Lie algebra of G is the set 

of all n x ji anti- symmetric matrices. 

Let A = (a ..) and B = (b..) be matrices defined as follows: 

— - U “ -VJ 

a . . . , = 1 for i = 1,... , n - 2, a. . , = - 1 for i = 2, . . . , n- 1, a.. =0 other- 

— j_,ji_+l — — 1 ~ — —i2 

wise, and let b . =1, b , = - 1, b = 0 otherwise. It is easy to show 

n- * » »i n f n- a ^ 'Xl 

that the smallest subaigebr that contains and JB is exactly L(G) . 

1 

Thus, even though SO(n) is -g n(n- 1) - dimensional, the system X = (A + u_ B) 
X, in which only one control is involved, is controllable. Moreover, as 
before, we can limit u to be ’’bang-bang". An easy argument shows that 
this fact, which has been shown to be true for the particular matrices 
A and B defined above, is in fact true for "almost all" pairs (A, B) e L(G) 
xL(G). Precisely, the set of pairs (A , B) such that A and B generate 
^(G) is open and dense in L(G) x L(G). 

R emark . If G is an arbitrary connected Lie group such that _L(G) 
is generated by two elements, then Theorem 5. 1 enables us to conclude, 
in a way similar to that of the previous examples that ihe homogeneous 
system on G of the form 3£(f) = (ii A + J£ 5) 2£Q) * 8 controllable for 
"almost all" pairs (A , B) e L,(G) x L(G). This result holds even if we 
restrict ^ and v to be "bang-bang". 

The previous statement holds, in particular, when G = S Jj(n, R), 
the set of all n x jn real matrices whose determinant is 1, or when 
G = G L(n, R), the set of all n x n real matrices whose determinant is 
greater than 0. 


C. 
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Example 8. 3. We show that, if A., ^ generate Ij(G), and if Q.is 

* 

connected and not compact, then the system X = (A + u B)X need not be 
controllable, even if Ij(G) is a simple Lie algebra. In particular, this 
will show that the necessary condition of Lemma 6. 8 is not sufficient. 
Take G = S L(2,R). Let 


A = 


1 0 


0 -1 


B = 


It is clear that A and B generate L(G) and that L (G) is simple. Let^i be 

an arbitrary control, and let3£(t) be the solution of the evolution equation 

corresponding to^. with initial condition X(0) = I. Let X= (x..). . . - • 

~ 11 

ThenXjj = — j] ‘*‘ii521 an ^*21 = — — 11 ”—21* Multiplying the first 
equation by x^, the second one by and subtracting, we get 

1 - . 2 2.22 

1 (*„- *21> = 2 n +i 2 l 

2 2 

Thus, the function x ^(t) - x^jlt^) is non-decreasing for every trajectory 

of our system. Since its value for t^ = 0 is 1 , it follows that every 

element of S L (2,R) that can be reached from the identity in positive 

2 2 

time satisfies the inequality Xj j a x^j + 1. Hence, the system is not 
controllable. In the notations of Section 3, it is clear that L = I^(G). 

Thus, we have shown thatA(e) is not a group. However, L ft = L(G) 

(because L(G) is simple), and hence S ft = G. Therefore, exp(t A) belongs 
to S ft for all t > 0. This shows that the condition of Lemma 6. 8 is satisfied. 

Example 8. 4. In this example we show that Lemma 6. 4 and Theorem 
6. 6 need not be valid if U is not assumed to be unrestricted. Let G = 

R x S*, the product of the real line and the unit circle. Let Xq be the 
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generator of the one parameter groups -*• (t,e, — ), and let X be the 

generator of the one parameter group t^ (^, 1). Let tJ = IJ^ , or . 

Then A (e) - [0,co) xj>* f which is not a group. But if \x = 0, then rr(j5,^i, 1) = 
ex P X 0 =(1,1) which belongs to Thus, Theorem 6. 6 does not hold. As 
for Lemma 6. 4 it is clear that S and A(e) are closed, but S A(e). 

Example 8. 5 In view of Theorem 6. 6 it might seem that a necessary 
condition for a right- invariant system to be controllable is that exp £ Xq 
" gets arbitrarily close" to S for some non- zero values of t^ This example 
shows that such a statement is false. 

Let G = S L (2,R), and let 



Consider the system X = (A_ + B u) X where ju belongs to the class 
of unbounded controls. 

Let u be the constant control u = 1. Then the trajectory^ -*• n(i_, u,t) 
is the curve_t -*-e + — ^ , which is periodic with period 2p. By 

Corollary 6. 7, the system is controllab. e 

/ e-^ 0 \ / 1 -t t 

, and e T ~ = 


We have that 


t A 
e — 


\ 0 / 


\ 


- T 1+T 


t A 


Now, it is obvious that e— — stays away from S for all positive values of 


t. 
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FOOTNOTES 


1. A solution of (*) is an absolutely continuous G -valued function of the 
real variable t^ with the property that (*) is satisfied for almost every 
t_. 

2. If is a subset of G, andjg eG, we usejA^g to denote the set of all 
products a g, where a g A. 

3. The result is proved in [10] for groups of matrices, but the proof is 
valid for arbitrary Lie groups. Alternatively, one could use Ado's 
Theorem (cf. [4]) to go from the result of [1C] to a "local" version of 
Lemma 4. 3, and then deduce the general result. 



LIE ALGEBRAS AND LINEAR DIFFERENTIAL EQUATIONS 


Roger W. Brockett* 
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1 . * Differ e ntial Equati ons 

In this paper we study certain symmetry properties possessed by the 
solutions of linear differential equations. This is accomplished by use 
of some basic ideas from the theory of finite dimensional linear systems 

together with the work of Wei and Norman [1] on the use of Lie algebraic 

\ 

methods in differential equation theory. Our study is also strongly 

motivated by the results of reference [2] which provided a link between 

« • 

the present paper and a number of questions about the controllability of 

s * 

systems for which the control enters mul tiplicatively. 

Let ]R nxm denote the set of real n by m matrices. By a Lie 
algebra in K nxn we understand a subset of lR nxn which is a real 
vector space having the property that if A and B belong to X then 
so does [A,B] - AB-BA. Given an arbitrary subset of K nxn we 
denote by the smallest Lie algebra which contains «AA We denote 

the identity matrix by I and introduce the square matrix 


*This work was supported in part by the U.S. Office of Naval Re- 
search under the Joint Services Electronics Program by Contract NOOOlA- 
67-A-0298-0006 and by the National Aeronautics and Space Administration 
under Grant NGR 22-007-172. 

**This work was supported by the National Science Foundation Grant 
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J = 



I 

0 


Letting prime denote transpose, we say that a matrix A is Hamiltonian 
if 

JA = (JA) . 


We call a matrix P symplectic if P'JP - J. 

Wei and Norman [1] have observed that for |t| small the solution 
of differential equations of the type 

X(t) * ( l a,(t)A,)x(t) 

1=1 1 1 


can be expressed as 

x(t) = e H,9l(t) e Hj9j(t) ••• e HM9w(t) x(0) 


where > is a basis for the Lie algebra of n by n matrices gen- 
erated by {A.j} and where the g^ satisfy a set of nonlinear differ 
ential equations. In this paper we investigate some aspects of this 
theory in the special case where v * 2. 


2. Lie Algebras and Rational Functions 

We begin by establishing tv/o results on the Lie algebra generated 
by a pair of n by n .matrices. 

In order to avoid undue repetition let us agree to call a matrix 
of rational functions G(s) regular if it is square and approaches zero 
as | s | approached infinity. Our first point is that it is possible to 
associate a Lie algebra with each regular matrix of rational functions 
in a natural way. This correspondence goes as follows. It is well 
known [3, 4] that every regular matrix of rational functions can be 
expressed as 

G(s) = C(Is - A)' 1 B 

with C € ]R m n , A £ IR n n , B £ ]R n m . Moreover it is always possible 
to pick A, B and C such that 

rank(B, AB, A n "^B) s rank(C; CA; ...; t CA n “^) s n 
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whm* ( , ) denotes a column partition and ( ; ) a row repetition. 
,Jn this case we s<«ythal the triple [A,B,C] »s ail n i. I rea 1 i nation 

of G(s). Now minimal realizations are not uniquely determined by G(s), 
but if [A,B,C] and [F,G,H] are two minimal realizations then there 
exists a unique nonsingular P such that PAP”^ * F, PB = G and 

CP"^ = H. This result first stated by Kalnian [3], is known in system 

theory as the state space isomorphism theorem (see e.g. [5] for an 

introductory account in the present notation). We now come to the Lie 

algebra. Given a regular matrix G(s) we find a minimal realization 
[A,B,C], and construct {A,BC}^, the Lie algebra of n by n matrices 
generated by A and BC. This collection depends o r the particular 
realization but if [F,G,H] is a second minimal realization of G(s) 
then F = PAP"^ and GH * PBCP"^ so that the Lie algebras are isomor- 
phic. That is (A,BC)^ and {F,GH}^ are matrix representations of 
the same abstract Lie algebra. We call this abstract algebra the Lie 
alge bra assoc iat ed with G(s). This Lie albegra reflect the symmetry 
properties of G(s) as the following theorems make clear. 

THEOREM 1 . Let A, B and C belong to lR nxn t lR nxm and R mxn re- 
spectively. Suppose [A,B,C] is a minimal realization of G(s) and 
suppose B and C are of rank m. Then: . 

i) There exists a nonsingular matrix P such that PAP”^ and 
PBCP"^ are both Hamiltonian if and only if there extsts a nonsingular 
symmetric matrix T *such that 

TG(s) * G'(-s)T . 

ii) There exists a nonsingular P such that PAP'^ and PBCP*^ 
are both skew-symmetric If and only if there exists a nonsingular skew- 
symmetric T such that TG(s) * G'(-s)T and 

«<*> * 1 r i xr « r i * r ; * ° • 

• * I S * A,| . 

Proo f. (Hamiltonian Case) Suppose that A and BC are Hamiltonian. 
Then we have JBC * C'B'J' and in view of the rank conditions JB s C'T 
for T e B'J'C* (CC‘ )“1 . Note T is nonsingular. Clearly C'TC s C'T 'C 
so T is symmetric. Thus (recall 0 ^ * -I) 
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TC'(s) - TC(Is - A)” 1 B 

« TCJ'(Is - JAJ' )~^ JB 

* B' <-Is - A' )“ 1 C , T 
. * 6' (-s)T . 

On the other hand, suppose that for some symmetric nonsingular T 
we have TG(s) * G'(-s)T. Thus 

TC( Is - A) -1 B * B* (-Is - A" )" 1 C , T 
* -B' (Is +A')" 1 C'T . 

Since both sides are minimal realizations it follows from the state 
space isomorphism theorem referred to above, that there exists a non- 
singular matrix P such that 

% 

PAP' 1 * -A’ ; PB * C’T ; TCP' 1 - -B‘ ; 

thus upon transposition and rearrangement we get 

P'AP 1 ' 1 * -A' ; P'B * -C'T ; CP,*" 1 * -B' . 

Now by uniqueness of P (compare with [4,6]) we see that P * -P'. 
Thus there exists a nonsingular Q such that Q'JQ *,P# Finally we 
see that [QAQ - ^ , QB,,CQ”*] is a realization such that QAQ“* and 
QBCQ“^ are Hamiltonian. See references [4] and [6] for additional 
insight into arguments of this type. 

♦ 

(Skew-Symmetric Case) Suppose that A and BC are skew-symmet- 
ric. Then we have BC * -C'B' and in view of the rank conditions 
B * C'T for T * (B'C'HCC 1 )" 1 . Note T is nonsingular. Clearly 
C'TC * -C'T'C and so T Is skew-symmetric. Thus 

TG(s) * TC( Is - A)*^ 

■ B 1 (Is - A) _1 B 

■ B* ( Is +A')' 1 B 

* B‘(-Is - A’r'CT 

* G* (-s)T . 
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All zeros of eh- L ( T ■.? •- A}" are on the imaginar • i i •“•'re*'' A * -A‘. 

The partial fraction expansion of (I$-A)~‘ has only terms of multi- 
plicity one since A is normal. Clearly the residues of IG(s) a 
TC( Is - A)"'C'T‘ at these poles are symmetric and nonnegative definite. 

On the other hand, suppose that for some nonsingul ar skew- symmetric 
T we have TG(s) - -G' (-s)T , with TG(s) given by the partial fraction 
expansion displayed in the theorem statement. Expand each as the 
sum of dyads and renumber (if necessary) the X's so that 


TG(sj » £ b.b! 

<=1 11 s z + 

with each b^ being an m by m vector. Now let A, B, and C be 
given by 



1 

o 

>* 

o 

o 
I 


r>n 

£ 

o 

o 

o 


0 

0 0 0 Xg 

i B * 

cr 

fN) * 

0 0 -X 2 0 


0 

• 



• 

• 


TC = [ b 1 0 b 2 0 •••] . 


Then TCfls-A^B * TG(s), A = -A‘ and BC * -C'B'. (Compare with 
Theorem 2 of [7] from which one can see a relationship .between this 
result and the structure of lossless electrical networks.) 1 


We now characterize the conditions under which the representation 
of the Lie algebras obtained this way are irreducible. We call a set 
of matrices (Aj , A 2 , ...» A n > irreducible if there exists no nonsin- 
gular P such that all whe PA^P"} are In block triangular form: 

T F i 6 il 

PAjP * ; F* * v by v ; H* * y by y . 

Lo Hj 

We recall the matrix form of Schur's lemma which says that a set 
of n by n matrices are Irreducible if and only if there exists no non- 
clngular matrix which Is not a scalar multiple of the identity and which 
commutes with all the matrices in the set. 
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THfOUFri Let G(s) bo a given regular -atnx of rational functions 
and let [A,B,CJ be a minimal realization of G($). Let A belong to 
Jt n * r » and let B and C belong to R nxm and F"*" respectively. 
Suppose B and C are of rank m. Then the Lie algebra {A»BC}^ is 
irreducible if and only if the set of m by ■ matrices G(C) is irre- 
ducible (C is the field of complex numbers and G(C) is its image 
under G(*)). 

Proof. Suppose that TG(s) = G(s)T for some constant matrix T which 
is invertible and not a multiple of the identity. Let [A,B,C] be a 
minimal realization of G(s). Then since 

Cds-A)* 1 # - r’cds-AJ^BT 

we see that [A,BT,T _1 C] is also a minimal realization of G(s). By 
the state space isomorphism theorem we know there exists P such that 

re * bt 
cp' 1 * r’c 
PAP' 1 * a . 

ft 

Since B is of full rank P cannot be a multiple of the Identity if 
T is not. 

On the other hand, if (A,BC> is reducible then there exists a 
tionsingular P, unequal to a multiple of "the identity, such that 
PAP~* * A and PBCP~* * BC. However, since B is one to one and C 
is onto, it follows that PB s BT for some nonsingular T and CP”* * 
RC for some nonsingular R. Thus BTRC • BC and since B and C 
have left and right inverses respectively, we see that T « and 
thus TG($) * G(s)T. Now P is not a multiple of the identity, and 
so PB f aB (this would violate uniqueness of P in the state space 
isomorphism theorem). Thus T is not a multiple of the identity and 
6(C) is reducible. * 

He note that in particular, if BC is a dyad and rank(B, AB, 

..., A n ”*B) a rank(C; CA; CA n "*) * n, then the representation 
(A,BC)a is irreducible and it is equivalent to a Hamiltonian algebra 
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if end only if n(s) - g(-s). In particular, the algebra associated 
yith l/s n depends bn whether n is even .or odd. It has been shown 
by direct construction in [8] that it is the full n(n+l)/2 dimensional 
Hamiltonian algebra if n is even. The Lie algebra associated with 
s n v(s n +l) is the full n^-1 dimensional algebra of zero trace mat- * 
rices for every n (see [8]). We observe that to generate skew-sym- 
metric algebras we can use a G(s) of the form 


p n + i r 

1 S 1 

P -1 J ,2+i L 

-S lj 


+ •< 


1 


s y 


for the odd dimensional case and 


G„(s) 


for n even. 


j_ p s i + -j_r ? s i 

2+1 L-s lj $2+4 L-S i J 


>• + 


s 2 +n* 


r L; a 


3. An Application to Stability 

As is well known, the symplectic matrices form a group and the 
eigenvalues of symplectic matrices occur in reciprocal pairs. That Is 
say, if X is an eigenvalue of a symplectic matrix then so is 1/X. 
This observation together with the basic ideas of Floquet theory en- 
ables one to show that for 0 < t < • all solutions of. . 

x(t) « (A(t) + cB(t))x(t) A(t+T) * A(t) j S(t"+T) - B(t) 

# • 

are bounded for e sufficiently small provided A(t) and B(t) are 
Hamiltonian and the solution of the equation 

*(t) * A(t)x(t) 

has distinct characteristic multipliers, all lying on the unit circle 
(see reference [9]). This together with Theorem 1 yields the follow- 
ing theorem. 

THEOREM 3. Let p and q be polynomials with the degree of p larger 
than that of q. Suppose k(t) is periodic with period T. Then there 
exists e > 0 such that for |k(t)|<e all solutions of 
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p(D)x(t) + k(t)q(D)x(t) = 0 ; D = 

* 

are bounded provided i) q(s)/p(s> = q(-s)/p(-s) where ii) p(s) - 
(s 2 +x|)(s 2 +x|)...(s 2 +X 2 ) with Xj all real and nonzero mod 2v/T 
with (Xj-Xj) nonzero mod 2v/T. 

Proof. Under the given hypothesis there exists a realization of 
[A,B, C] of q(s)/p(s) such that A and BC are Hamiltonian. Thus 
we can express the evolution equations in first order form as 

x(t) = (A + k(t)D)x(t) 

AT 

with A and D Hamiltonian. By hypothesis e has all its eigen- 

values on the unit circle, and none are repeated. Thus by the pertur- 
bation result quoted, there exists e > 0 such that if |k(t}| < e 
for all t and k(t+T) = k(t) then we have stability. 

* 
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